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PREFACE 

The present volume on educational measurement is dom- 
inated by two main ideas, first, that the work in measurement 
should be handled more and more by the individual class- 
room teacher ; and second, that the chief purpose to be served 
by standard tests is the diagnosis of pupil ability and pupil 
difficulties. When standard tests were first devised and dur- 
ing the experimental stage, it was well to leave their adminis- 
tration largely in the hands of experts. But now that the 
value of such tests has been fully demonstrated, it is important 
that all teachers master the technique of scientific testing, 
and that courses in educational measurement be included as 
a necessary training for all teachers. Standard tests are a 
new and valuable educational tool or set of tools. No teacher 
is fully equipped who has not mastered their use. 

The value of standard tests for diagnostic purposes is now 
generally recognized. Diagnostic tests are replacing others 
not well adapted for such purposes. This means that the 
tests are used to locate pupil weaknesses in order that such 
weaknesses may be corrected. The individual child thus 
becomes the center and object of the work. It is not school 
systems as such but children that are important. That we 
have so quickly, in the use of standard tests, come to recognize 
that the child is the true center and the true object of consid- 
eration, is an indication that to-day as never before the spirit 
of progress and service is dominating and determining all 
educational effort. 

The purpose of this volume is not a critical evaluation of 
all the available tests on different subjects, but a treatment 
of those tests which on account of their use, purpose and adapt- 
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ability have been found to be most serviceable to the class- 
room teacher. To this end classroom results from the use 
of certain tests and the teachers' reactions to them expressed 
in their own words have been freely used. The work will 
serve as a handbook for the classroom teacher and also as a 
text for use in teacher training classes in high schools, normal 
schools, and colleges, or as a basis for reading circle work. 

The authors have drawn freely from the many available 
sources and they acknowledge the many courtesies, little and 
great, extended by authors of standard tests and by cooperat- 
ing teachers and superintendents. 

While the present work is issued as a completed volume, 
it is expected that revisions will be necessary, since standard 
tests have undoubtedly not reached their final development. 
It will be the purpose of the authors to keep the work up to 
date by as frequent revisions as may be necessary in order to 
provide teachers with the latest and best essential data on 
educational measurements. 

G. M. Wilson 
Kremer J. Hoke 

Ames, Iowa, 
January i, 1920. 
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HOW TO MEASURE 

CHAPTER I 

THE NEV ATTITXJDE TOWARD MEASUREMENT 

When Dr. J. M. Rice, a little more than twenty years ago, 
published his studies applying scientific measurement to the 
results of teaching, there was a storm of protest by educators 
from one end of the country to the other. It was apparent 
that the educational leaders of the country were not ready to 
follow Dr. Rice's lead. The present movement started with 
studies of a somewhat different nature, such as Thorndike's 
notable study on "The Elimination of Children from the 
School" and studies by Strayer and Elliott upon school 
costs. The application of scientific methods to these phases 
of education was received with mpre favor by educators, 
and the emphasis was gradually shifted to the measure- 
ment of subject matter through the use of scales and 
standardized tests. Thus, after two decades, Dr. Rice's 
viewpoint was accepted and his methods improved upon. 
' It was fortunate that the first scale for the measurement of 
subject matter was the one in handwriting. The value of 
this scale became apparent immediately, and by degrees 
standards for grade attainment in speed and quality were 
set up. These standards were first developed by practical 
school men through the actual use of the scale.^ 

^Wilson, G. M., "The Handwriting of School Children,'* Elementary School 
Teacher^ ii : June, 1911, pp. 540-543, and Freeman, Frank W., "Some Practical 
Studies of Hdrndwritlng" Elementary School Journal, 14 : Dec. 1913, pp. 167-179. 
Partly reported by J. F. Bobbitt in the Twelfth Yearbook of Uie National 
Society for the Study of Education, Part I, 1913, pp. 7-96. 

B I 



2 How to Measure 

Even so, when it was proposed at the Philadelphia meeting 
of the department of superintendence in 1913 that a com- 
mittee on school eflSciency be appointed, there was vigorous 
opposition. The proposal was merely for the appointment 
of a committee, yet a decision required a standing vote and 
carried by a majority of only one. The next year, at the Rich- 
mond meeting of the department of superintendence, it was 
surprising to note the change in sentiment. 

The growth that may take place with an individual 
in a single year is well illustrated by the remarks of Super- 
intendent Ben Bluett, of the St. Louis public schools. 
At the Philadelphia meeting, in his usual sincere and 
thorough way of thinking, he was very much disturbed 
that a group of young men should propose the measure- 
ment of "childhood," "mother love," and other intangible 
elements of the educative process. There was, in fact, 
never any intention of trying to measure these elements, 
but such terms were used by tile opposition, and it was Ben 
Bluett's impassioned appeal against such procedure which 
had much to do with the large vote against the proposal for 
the appointment of a ccfmmittee on measurement and school 
efficiency. A year later it was generally agreed that the 
feature of the Richmond meeting was Ben Bluett's confession. 
He had been made a member of the committee appointed at 
'Philadelphia. He had met with this committee, fifteen in 
number, several times during the year, and had studied the 
question earnestly with the other members of the committee. 
He had begun to realize the significance of the movement 
and had secured the cooperation of Dr. Withers of the St. 
Louis College for Teachers in applying some of the tests in 
the St. Louis schools. The loyal, sincere, whole-hearted 
manner in which Ben Bluett acknowledged his lack of under- 
standing of the movement a year before, and his thorough 
conversion to the advantages of the movement, swept away 
whatever opposition there may have been in the Richmond 
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meeting. From that time forward the progress of the move- 
ment has been only a question of ways and means, and 
better adaptation to secure the desired results. Even the 
school survey movement, that phase of the school efficiency 
movement which has been most feared by superintendents 
because of its frequent use by an opposition to discredit 
the work of the schools, has entered upon new life and 
has become an integral part of the American public school 
system. 

It must not be assumed, however, that the work in measure- 
ment in the public schools has been perfected. It has passed 
the first stages. Leaders are convinced. Useful scales and 
tests have been developed. The technique of formulating a 
test has been further perfected and the value of a scientific 
test is better understood. In some respects we have entered 
the second stage of measurement. We have come to the 
point of discriminating between good and bad tests. Already 
a few standardized tests have been discarded. 

We are now quite surely approaching a third stage of 
development, and that is the stage in which the tests shall 
be thoroughly weighed and judged as to the fundamental 
considerations of curricula making involved, whether they 
are or are not testing desirable school products, and whether 
their use will or will not lead to better methods of teaching 
and better selection of subject matter. In this stage the 
standard tests will be used more and more for the diagnosis 
of the weaknesses of individual pupils, more and more in 
testing the efficiency of methods of teaching. It is in this 
third stage that the rank and file of the teaching profession 
are necessarily involved. If the tests are to be of service, 
not merely as a general measure of the efficiency of a school 
system, but also of service to the teacher and for the pupils in 
the schoolroom, then it becomes necessary that the individual 
teacher shall master the details for actually using the tests 
in her own schoolroom. This is not too much to expect if a 
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man well beyond sixty, as was Superintendent Bluett, 
could approach this movement with an open mind and 
accept its benefits after a year of conscientious study. 

That teachers are interested and keen to master the 
accumulated knowledge with regard to measurement is more 
and more apparent. Hence this effort is made to bring to- 
gether the various contributions on the subject in form for 
use by the teacher. It is true, of course, that we shall make 
slow progress in educating the entire teaching profession 
imtil teachers become a trained body of professional educa- 
tors with permanent tenure. But for this it were unwise 
to wait. In the meantime, may we not expect that any 
one who has accepted the responsibilities of the teaching 
profession will consider that she owes it to herself and to her 
pupils to master the details of using scales and standard- 
ized tests for the measurement of subject matter? 



CHAPTER II 

THE MEASUREMENT OF SPELLING 

There are at present several spelling tests available. Before 
deciding on which one to select for use, it will be well to 
consider what should be tested in spelling.^ 

It appears that a person needs to spell only when he writes. 
People are therefore good spellers, for all social purposes, 
when they spell correctly the words which they use in their 
written work, such as writing letters, articles, club papers, 
compositions, school exercises, business notes, and the like. 
Manifestly the words used under such circumstances are the 
foimdation words of the EngUsh language. The first require- 
ment of a test in spelling, therefore, is that it be based upon 
the common fimdamental words of the EngUsh language. 

What to Test. — Much progress has been made in deter- 
mining the fundamental words in the English language. 
Dr. W. Franklin Jones, at the University of South Dakota, 
studied the writing vocabulary of grade pupils by analyzing 
the words in the composition work of 1050 pupils residing in 
four different states. The work was so managed as to lead 
pupils to cover all the various fields of experience, and so 
exhaust the words in their several vocabularies. The pupils 
continued to write until new words ceased to appear in their 
compositions. In all, 75,000 themes were secured, consisting 

^ The teacher who waints further help on the value of measurement in edu- 
cation should take time to read Chap. XII before proceeding with the present 
chapter. The teacher unfamiliar with statistical terms will need to consult 
Chap. XI as terms occur in this and succeeding chapters. For the practical 
Mses of the spelling tests, see the last section of this chapter, beginning on 
page 19. 

5 
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of a total of 15,000,000 words. Dr. Jones spent eight years 
collecting and scoring these data. When completed, it was 
found that a total of only 4532 different words had been 
used by all these pupils. The largest single vocabulary 
consisted of 2812 words, the vocabulary of an eighth grade 
girl. The result of this study was to give a list of words 
which accurately represents the fundamental words used by 
school children. Apparently, it contains also the funda- 
mental words of the EngUsh language. 

Other studies have been made. One of similar character, 
which has led to the formation of a spelling scale, was 
conducted by Dr. Leonard P. Ayres. Dr. Ayres examined 
a total of 368,000 words written by 2500 different persons. 
This was a summary of previous studies. The first of these 
studies included in all about 100,000 words taken from 
standard literary selections. The second was an analysis of 
250 different articles which appeared in four Sunday news- 
papers published in Buffalo. The third consisted of the 
tabulations of 23,629 words from 2000 short business letters. 
The fourth consisted of some 200,000 words taken from the 
family correspondence of 13 adults. 

The Ayres study has the advantage of being based upon 
the words used by adults, and if we assume that the schools 
must prepare for active social participation on the adult 
level, then certainly Dr. Ayres' study would be above criticism 
from the standpoint of determining the fundamental words 
of the English language in common use for writing purposes. 
The Jones study and the Ayres study are in complete agree- 
ment as to the simplicity and small compass of the writing 
vocabulary. 

Any adequate test must be based upon the words of the 
language that are in common use and fundamental in written 
work. 

The Ajrres Scale. — In undertaking to form a scale for 
testing the spelling of school pupils, the first thing which 
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Dr. Ayres did was to determine the words which were most 
fundamental. The 368,000 words of his study were made up 
largely of repetitions. Fifty different words were repeated 
so frequently that they made up approximately half of the 
entire Ust. Dr. Ayres had fixed upon 1000 words as the 
number which he should select. In order to get the 1000 
words, he finally took all words which had been repeated as 
many as 44 times in the entire study. 

The next step was to arrange the different words according 
to difficulty, in order to secure a graded test, or, in other 
words, a spelling scale. To determine the relative difficulty of 
the words in the 1000 Ust, Dr. Ayres arranged to have the 
words spelled by school pupils. Fifty Usts of 20 words each 
were constructed, and the words included in these lists were 
pronounced to the pupils of the various grades in the middle 
of the school year in the schools of 84 cities scattered through- 
out the United States. The data secured from these tests 
gave a total of 1,400,000 spellings by 70,000 school children. 
On the basis of these data, the 1000 words were divided into 
26 groups according to difficulty. This will be understood 
by reference to the scale. (See scale inserted herewith.) 

Group " A " consists of " me " and " do," and these 
words were spelled by 99% of the second grade pupils. At 
the other extreme, Group " Z," consisting of " judgment,'' 
" recommend," and " allege " were spelled by only 50% of 
the eighth grade pupils. The scale is simple, and easily 
understood. At the top of each column is shown the average 
per cent of the words spelled by each grade, except that 
report is not made upon any grade for per cents below 50- 
The blank spaces to the left, however, if filled in, would 
indicate in each case 100%, — that is to say, the eighth grade 
pupils spelled all of the words correctly from columns " A " 
to " N " inclusive. 

Giving a Test. — A good test should be so difficult that 
no pupil in the grade will make a perfect score, and sufficiently 
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easy that most pupils in the grade will secure a fairly satis- 
factory score. In selecting words, therefore, to test the 
spelling abiUty of a particular grade, it would be well to 
choose the words spelled correctly by about 70% of the 
children of that grade. If pupils in the third grade were 
being tested, the best test would result from the use of words 
selected from column " L.'' A test, in order to be vaUd for 
individual pupils as well as for the group, should consist of 
at least 20 words. A smaller number of words would be 
equally vaUd for an entire school system, but the teacher 
will desire to know the standing of individual pupils, and so 
will need to use 20 words for the test. If 40 words were used, 
the results would be more reliable for individuals. 

The tabulations of the scale are based upon tests given by 
the column method. This is the usual method of dictating 
words for pupils to spell by writing in columns. The Cleveland 
Survey shows that the returns from testing by this method 
differ very Uttle from returns secured when the words are 
used in context. Other studies show that the contextual 
method (including words in complete sentences, the entire 
sentence being written) gives a slightly lower score. It is 
recommended, therefore, that teachers test by the column 
method. All that is necessary is that the pupils be given 
sufficient time to write a word before proceeding to the next 
word. The teacher should also be accommodating in re- 
pronouncing a word wh^n necessary, in order to have it 
understood. Pronounce the words clearly, but do not sound 
them phonetically, or inflect them so as to aid the pupils in 
spelling. Give the meaning of words that sound like words 
with a different meaning and spelling. In case of difficulty 
in understanding a word, the best way to explain it is to use 
it in a simple sentence. 

Scoring the Papers. — If there were 30 pupils in the third 
grade class above referred to, that would give a total of 600 
spellings. Suppose that of these 600 spelUngs, 480 were 
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correct. Then 80% of the words were correctly spelled. 
Referring now to column " L " of the scale, it will be observed 
that the class, as a whole, is 7% above the standard of third 
grade pupils in the 84 cities which formed the basis for the 
scale. They are at the same time 8% below the standard 
for fourth grade pupils. Suppose that a particular child in 
the grade has spelled 17 words out of the 20, — that would 
mean a grade of 85%. This is better than the class average 
and only a Uttle below the standard for the fourth grade. 
In the same way, the standing of each pupil in the grade may 
be determined. 

In order to see at a glance the condition of her class, the 
teacher will find it worth while to arrange the scores for her 
grades in a distribution somewhat as follows : 

Table i. — Distributed Spelling Scores for 30 Thirj) Grade 

Pupils. Standard 73 



Grade : 



^Aaa^cC 



Date : 



Score . . . 


40 


45 


50 


55 


60 


65 


70 


75 


80 


85 


90 


95 


No. of pupils 








I 


I 


2 


4 


5 


7 





3 


I 



100 



This table means that one pupil made a score of S5, one 
a score of 60, two a score of 65, four a score of 70, etc. 
This distribution emphasizes the needs of particular pupils. 
If the teacher of this particular third grade class can, by special 
work with the one pupil at 55, the one at 60, the two at 65, 
and the four at 70, bring these pupils up to the grade's stand- 
ard, she will have a very satisfactory situation. 

One of the advantages of the Ayres spelling scale is its 
simpUcity and the ease with which it can be used. Because 
it contains the fundamental words of the language and the 
words on which the pupil should place his attention, the 
changes which it effects in the character of the spelling work 
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will be entirely in the right direction. To the extent that it 
does thus direct the attention to the proper kinds of words, 
we may expect that scores in particular cities will rapidly 
become higher than those indicated on the Ayres scale. 
This fact is indicated by the returns from the use of the 
Ayres scale in Boston, after considerable attention had been 
given by the teachers to the proper selection of word lists. 
Dr. Ayres himself has recognized this possible Umitation, 
closing the discussion of his spelling scale with the following 
words : 

" In all such testing, it must be remembered that the present 
scale or any scale for measuring spelling attainment will become 
increasingly and rapidly less reliable for measuring purposes as the 
children become more accustomed to spelling these particular 
words. In proportion as these lists are used for the purposes of 
classroom drill, the scale will become untrustworthy as a measur- 
ing instnmient. Probably the scale wiU have served its greatest 
usefulness in any locality when the school children have mastered 
these looo words so thoroughly that the scale has become quite 
useless as a measuring instnmient." 

Other Tests. — While it is recommended that the grade 
teacher use only the Ayres scale in testing her pupils as in- 
dividuals and her room for comparison with other rooms 
within the city or elsewhere, many teachers, and especially 
superintendents, will desire at least some information con- 
cerning other lists which have been used as spelling tests. 
The most notable of these are the Buckingham extension of 
the Ayres scale, the Iowa Spelling scale, the Buckingham 
scale, the Rice test, the Starch test, the Courtis Spelling 
test, the Boston Minimum Ust, and Jones' One Hundred 
Demons. 

Buckingham's Extension of the Ayres Scale, — Dr. Bucking- 
ham's extension of the Ayres scale (first available in 191 9)* 
consists of the addition of 505 words chosen on the basis of 
agreements among spelling books. The words are added, for 
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the most part, to the upper end of the Ayres scale. This 
mcreases the number of words in the columns at the upper 
end of the scale and also extends the scale six steps to the 
right. The added words are not offered as constituting a 
fundamental vocabulary in the same sense as were the 
original looo words selected by Ayres. In using this ex- 
tension, therefore, teachers should keep in mind that the 
added words have less value from the standpoint of social 
utiUty than the looo original words of the scale. The addition 
of these words, however, makes it possible to use the scale 
more extensively in upper grades and high school. It should 
be of particular value in testing the spelling efficiency of the 
pupils in the high school who are specializing in commercial 
studies. 

The Iowa Spelling Scale, -r This scale includes 2977 words 
from the written correspondence of Iowa people. Accuracy 
of each word was determined on the basis of 200 or more 
spelhngs by children in each grade. Thus, more than 650,000 
spellings were used in each grade, or a total of nearly 4,750,000 
in the seven grades. In all essential features the scale is an 
imitation of the Ayres scale. The placing of the words is 
determined in practically the same manner and the form of 
the scale is similar. It has decided value, however, as 
showing the possibility of basing the spelling work directly 
upon the words of a particular section of the country. The 
scale is published in three parts in order to reduce the error 
in the placement of words. Part i is a scale for grades 2, 3, 
and 4; part 2, a scale for grades 4, 5, and 6; and part 3, a 
scale for grades 6, 7, and 8. The large increase in the number 
of words makes the scale particularly valuable for individual 
testing. 

The Buckingham Scale. — The work of Dr. Buckingham in 
evaluating a list of 50 words, has to date proved of value 
chiefly in calling attention to the importance of the propei 
selection of word Usts, the difference in the difficulty of words, 
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and the methods to be used in the further study of words for 
spelling lists. The scale first appeared in 1913, and apparently 
has not come into general use in school testing and school 
survey work. The Ayres scale, which made its appearance 
a Uttle later, is so convenient and so satisfactory that it has 
been extensively used by superintendents, bureaus of efficiency, 
and survey committees. 

The fifty words resulting from the Buckingham study are 
given herewith, in the order of their difficulty. These words 
vary in difficulty by even distances, so that the scale, as it 
appears, is a step scale. Theoretically it should be used in 
such a way as to determine how far up the scale a pupil can 
spell successfully. It can be used in grades three to eight. 

Dr. Buckingham, in deriving the scale, pronounced all of 
the words to the children in contextual form. In view of other 
studies which have been made, it appears that they could be 
used in column form with results slightly varying and equally 
satisfactory for comparative purposes. Although not in 
general use, the scale is mentioned because of the high quaUty 
of the scientific work involved in its formation. It has not 
been evaluated in terms of grade achievement. However, 
Dr. Buckingham is working on an extension of his scale. 
In time, he expects to extend it to include 1000 words and 
evaluate it in terms of grade achievement. 

Buckingham's Fifty Words Arr.\nged in Order of 

Difficulty 

17. cousin 

18. beautiful 

19. touch 

20. freeze 

21. forty 

22. instead 

23. wear 

24. tailor 



I. 


only 


9- 


pretty 


2. 


even 


10. 


nails 


3- 


smoke 


II. 


butcher 


4- 


chicken 


12. 


Tuesday 


S- 


front 


13- 


sure 


6. 


another 


14. 


answer 


7. 


lesson 


IS- 


nor 


8. 


bought 


16. 


raise 
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25. trying 

26. minute 

27. pear 

28. towel 

29. tobacco 

30. whole 
3i« button 

32. janitor 

33. quarrel 



34. against 

35. circus 

36. sword 

37. whistle 

38. stopping 

39. carriage 

40. guess 

41. telephone 

42. choose 



43. telegram 

44. saucer 

45. saucy 

46. already 

47. pigeons 

48. beginning 

49. grease 

50. too 



The Rice Test. — It was Dr. J. M. Rice, in his Forum 
articles of 1897, who first began the work of attempting a 
definite measurement of spelling. He gave three different 
tests, the number of children examined reaching nearly 
33,000. The first test consisted of 50 words pronounced by 
the teachers for written spelling in the usual manner. The 
words used in this test were the following : 



furniture 


beggar 


breakfast 


Missouri 


chandelier 


pliunber 


chocolate 


Alleghenies 


curtain 


superintendent 


cabbage 


independent 


bureau 


engine 


dough 


confectionery 


bedstead 


conductor 


biscuit 


different 


ceiling 


brakeman 


celery 


addition 


cellar 


baggage 


vegetable 


division 


entrance 


machinery 


scholar 


arithmetic 


building 


Tuesday 


geography 


decimal 


tailor 


Wednesday 


strait 


lead 


doctor 


Saturday 


Chicago 


steel 


physician 


February 


Mississippi 


pigeon 


musician 


autumn 







Dr. Rice had some question as to the value of word lists 
for spelling work, recognizing that spelling was useful only 
as a means for recording or communicating thoughts. This 
is the same point which we now recognize in different form ; 
viz. that only the written vocabulary needs to be mastered 
for spelling purposes. 
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In line with this thought, Dr. Rice gave a second test to 
more than 13,000 children. This test contained 50 words 
placed in composition form. The following sentences were 
used, the underscored words forming the basis of the test : 

"While running he slipped. I listened to his queer speech, but 
I did not believe any of it. The weather is changeable. His loud 
whistling frightened me. He is always changing his mind. His 
chain was loose. She was baking cake. I have a piece of it. 
Did you receive my letter? I heard the laughter in the distance. 
Why did you choose that strange picture? *Because I thought I 
liked it. It is my purpose to learn. Did you lose your almanac? 
I gave it to my neighbor. *I was writing in my language book. 
Some children are not careful enough. Was it necessary to keep 
me waiting so long? Do not disappoint me so often. I have 
covered the mixture. He is getting better. *A feather is light. 
Do not deceive me. I am driving a new horse. *Is the surface 
of your desk rough or smooth? The children were hopping. 
This is certainly true. I was very grateful for my elegant present. 
If we have patience we shall succeed. He met with a severe 
accident. Sometimes children are not sensible. You had no 
business to answer him. You are not sweeping properly. Your 
reading shows improvement. The ride was very fatiguing. I am 
very anxious to hear the news. I appreciate your kindness, I 
assure you. I cannot imagine a more peculiar character. I 
guarantee the book will meet with your approval. Intelligent 
persons learn by experience. The peach is delicious. I realize 
the importance of the occasion. Every rule has exceptions. He 
is thoroughly conscientious; therefore I do trust him. The 
elevator is ascending. Too much praise is not wholesome." 

(The fourth and fifth year test ends with: "This is certainly 
true." The higher test includes all the sentences except the four 
marked with an asterisk.) 

The third test given by Dr. Rice, and the one which he 
considered really more valid than any of the others, was a 
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composition test based upon a picture and a story told by 
the teacher. This test was valuable particularly in that it 
required pupils to choose their own words and to spell them. 
The resxilts on the third test are not tabulated by Dr. Rice, 
but we do have some tabulations on the first and second 
tests and the averages are given herewith. 



Table 2. — Rice Tests — 1895 



Grade 


Average First Test 


Average Second Test 


(Column List) 


(Context) 


4 


53.5 


64.2 


5 


64.3 


75.1 


6 


75.6 


70.4 


7 


81. 


78.8 


8 


84.2 


84.4 



Since Dr. Rice gave these tests to a sufficiently large number 
of pupils, the teacher may accept the averages given above 
as norms or standards of performance, and by comparison 
with them may determine the spelling ability of her own pupils. 

The chief objection to the Rice list is that the words are 
not evaluated, and do not form a scientifically constructed 
scale. The words are given imiform values, but are far from 
being uniform in difficulty. In the Ayres and Buckingham 
scales, the words are assigned values according to difficulty. 
In the Rice test, a pupil gets as much credit for speUing an 
easy word as he does for spelling a difficult word. 

The Starch Test, — Any one making use of the Starch test 
in spelling will do it with quite different purposes in mind 
than those for which he uses the Ayres scale. The words were 
secured by taking the first defined word on the even-numbered 
pages of the 1910 edition of the New International Dictionary. 
Proper names, technical words, and obsolete words were dis- 
carded from the list. The list, thus reduced to 600 words, 
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was arranged alphabetically according to the size of the words. 
These were then divided into six lists of loo words each by 
assigning words in turn to the six lists. A test is made by 
using one of these six lists, which are assumed to be of equal 
difficulty as lists. 

By using words selected at random from the entire EngUsh 
language, Starch proposes to test general spelUng abiUty, 
and his tests will be found to be of service in the grammar 
and high school grades, provided the test is not permitted in 
turn to exercise an influence upon the teacher in determining 
the materials of the spelling lessons. The influence of the 
Starch test is surely in the direction of the old " spelling grind " 
described by Rice. The Starch lists contain such words as 
the following : 

nunciature conterminous anthropometric 

quarantinable photosphere imperturbation 

Such words are manifestly not suitable for use with grade 
pupils. 

The Boston Minimum List, — The Boston School Document 
No. 8, 1914, contains a minimum speUing Ust of 840 words. 
They are well selected, and similar in many respects to the 
Ayres list. However, they have not been evaluated for use 
as a standard test. The document containing this list, and a 
supplementary list of 2525 words, is no longer available 
except in libraries of departments of education. It is of 
interest chiefly in showing the tendency to get away from 
the old type of speller which contained 10,000 to 15,000 
words, selected with Uttle regard for use. The CaUfornia 
list ^ is similar to the Boston list and is constructed along 
similar lines. It is of value for curriculum making in spelling, 
but not for testing. 

Jones^ One Hundred Demons, — Dr. Jones has given a list 
of the 100 words most often misspelled by pupils in written 

^ Bulletin No. 7, Chico State Normal, Chico, California. 
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work, as shown by his study involvmg the tabulation of 
15,000,000 words. This list he has designated as the " speUing 
demons." The list has been widely used for testing, but to 
date it has not been sufficiently evaluated in terms of grade 
standards, although Dr. Jones promises such evaluation in 
the near future. The Ust appeals to children because of its 
simplicity, and its known difficulty. If a pupil thoroughly 
masters this list of " demons " he will very probably correct 
the spelling of most of the words which he has been mis- 
spelling. Dr. Jones did not find any pupil among the 1050 
who missed as many as 100 words, 87 being the largest list 
for any one pupil. 

The list of " spelling demons," together with their relative 
difficulty as shown by preliminary tests which Dr. Jones has 
summarized, follows herewith: 



Frequency of 

which 321 
their 316 
there 296 
separate 283 
hear 280 
here 278 
said 275 
been 273 
says 273 
they 271 
some 270 
any 268 
Wednesday 266 
done 263 
know 263 
read ("red") 261 
piece 260 
don't 258 
break 257 
tear 255 



Misspelling of the Jones' 100 Demons 



meant 247 
just 24s 
many 245 
too 243 
Tuesday 242 
knew 237 
lose 236 
week 235 
can't 234 
grammar 234 
whole 231 
wear 230 
every 228 
instead 228 
built 225 
blue 224 
shoes 224 
won't 221 
wrote 220 
cough 217 



minute 210 
busy 209 
two 208 
much 206 
enough 206 
seems 205 
none 203 
does 203 
easy 202 
would 200 
whether 200 
loose 198 
could 196 
ready 196 
beginning 195 
heard 195 
country 194 
business 194 
ache 192 
answer 191 



often 185 
writing 184 
doctor 182 

• 

very 182 
though 181 
among 179 
sure 179 
tonight 174 
forty 172 
since 172 
once 170 
raise 169 
trouble 168 
choose 168 
color 167 
dear 166 
truly 166 
early 166 
used 165 
friend 164 
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February 255 
laid 252 
straight 251 
through 250 
half 250 



where 216 
write 216 
buy 212 
believe 212 
coming 212 



making 190 
always 188 
hour 187 
tired 187 
sugar 185 



again 164 
hoarse 162 
guess 162 
women 161 
having 158 



The PupiVs Own List of Misspelled Words, — The final 
test of spelling is a gradual decrease in the pupil's own list of 
misspelled words. A necessary precaution in this connection 
is that pupils should not consciously avoid good words because 
they do not know how to spell them. They should be taught 
to use the dictionary instead of replacing good words by 
simpler words which they are able to spell. If every child 
is told to keep a list of his own misspelled words and to 
build up a spelling consciousness with the aid of the dic- 
tionary, and if he is urged constantly to extend his vocabulary 
and to study the choice of words in order to get appropriate 
and accurate expression, a pupil's spelling in regular written 
work may be considered as the best and the final test of spelling. 

At stated intervals, a pupil should be encouraged to go over 
8*or 10 pages of his written material and determine carefully 
the number of misspelled words. The teacher can help the 
child in doing this. But for the teacher to do it without 
the child's help has been in general the mistake of the past. 
In proportion as the number of misspelled words decreases, 
the child is improving in spelling. 

While this test is not scientific, we can conceive of teachers 
making it even more valuable than scientific tests as they are 
frequently used. We do know that the time which a pupil 
spends upon his own list of misspelled words involves no 
lost effort; and that his spelling improves in the same pro- 
portion that this list is reduced. Indiscriminate drill in 
spelling, as indicated in the Butte, Montana, survey, must 
be replaced by attention to the needs of individual pupils. 
There were 278 of the Butte children, or over 18% of the 
total, who made scores of less than 60%, although the total 



The Measurement of Spelling 19 

score for the city was 10.3% above the Ayres standard. 
Much time had been spent upon indiscriminate drill. 

The Practical Uses of a Spelling Scale. — Teachers will 
find a spelling scale of very great use in their regular school 
work, aside from any supervisory use which the superintendent 
may make of the tests given. Tests administered under uni- 
form conditions and with a scientifically constructed scale per- 
mit the teacher to compare one class with another very accu- 
rately. If the fourth grade teachers in a city system would 
agree among themselves to give a test on a certain day, they 
could then come together after the papers had been scored and 
find out, first of all, which room was doing the best work. 
This would be shown not only by the median score, but also 
by the total distribution which shows the number of pupils 
at lower as well as at higher levels. 

After the teachers have agreed that a certain one of the 
fourth grade rooms has made, all told, the best score in the 
test, a second question naturally arises; namely, what 
method was used in securing these results with your children? 
This question suggests the second use which the teacher may 
make of the scale. She can test out different methods in her 
own room, or the particular group of fourth grade teachers to 
which we have referred may separate their rooms into groups 
of approximately equal ability and assign different methods 
for different groups. Then, at the close of a given period, — 
one, two, three, or six months, — they may again give a test 
and so determine which methods are most effective. If the 
teachers have been wise they have determined in great detail 
how the methods were to be applied and the amount of time 
to be devoted to the spelling work, so that the one thing 
which is upon trial is the method of presenting the work; such, 
for instance, as the column method, the contextual method, 
the method of studying at home or in the seat and then testing 
in class, the method of teaching in class with very little 
testing, and various other methods. 
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The above paragraph suggests a third point which teachers 
may try out by the use of a scientific scale ; namely, the 
amount of time which can profitably be devoted to spelling. 
Dr. Rice, in his discussion of the spelling grind in 1897, showed 
that the time element had very little to do with results. We 
now know that this was because of the character of the spelling 
lists. When the words used in the spelling work with children 
are unintelligible to them, the results will be poor, regardless 
of the methods and the time devoted to the work. But if 
we assume words with correct social values, then the Ayres 
scale may properly be used for determining the amount of time 
which can be spent upon the spelling work with greatest profit. 

A fourth use of the spelling scale has been suggested in 
asking the teacher to make a distribution of the grades. This 
use is to locate the spelling ability of individual children. 
By doing this, the teacher will probably find in her classes a 
small number of pupils who spell so well that it is unnecessary 
to require them to submit to any regular spelling drill. If 
such pupils are excused from spelling drill, being told merely 
to attend to their own misspelled words and to use the dic- 
tionary when in doubt, and if the teacher finds in future testing 
that these pupils do not lower their scores, then she may 
feel that she has saved their time for other more valuable 
work without detriment to them, so far as spelling is con- 
cerned. At the other end of the scale, however, will be 
pupils who spell very poorly, and it is only by use of the 
scale that these pupils can be located with any degree of 
accuracy. Taking these pupils as individuals, or as groups 
according to their several needs, the teacher can work in a 
definite manner, giving additional time to some pupils without 
boring others, and really follow out the injunction of William 
Hawley Smith to " put the oil where the squeak is.'' It is 
quite probable that this result of the use of the scale in 
spelling, as in writing, will in time become one of its most 
valuable contributions. 
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Some pupils will make low scores in their spelling work 
because of the lack of general intelligence ; others, because of 
the lack of an adequate vocabulary, which can come only 
from reading ; others because their attention has never been 
directed to the difficulties of words, etc., etc. The teacher 
will know that she is working at the problem in a definite 
manner, and that she is working only with the pupils who 
need attention. This she has known more or less before in a 
general way, but the use of a scientific scale permits her to 
know it beyond peradventure of a doubt. 

It is not the purpose of the present work to discuss methods 
of spQlling. The teacher is directed to other works dealing 
specifically with this problem.^ The teacher will do well, 
however, to make her spelling work as specific as possible, 
both as to words and pupils. Many words spell themselves 
and require no attention, others are very difficult for large 
numbers of pupils. It is not only necessary to locate the 
words, but to analyze each word to see in what the difficulty 
consists. In short, drill which is general and bUnd must 
become specific and intelUgent. 

The discussion throughout has directed the attention of the 
teacher to the Ayres scale. Some teachers may properly 
ask if other scales may not at times be used to advantage. 
There will be no harm done in using other scales and the 
teacher may learn to use the Buckingham scale very effec- 
tively. The Jones' "Demons" have the advantage of being 
the words most frequently missed by school pupils. They are 
common words, and it is safe to assume that every pupil in 
the upper grades should study these words until he can spell 
the entire Ust without a mistake. In general, however, the 
Ayres scale is the one to use, for reasons which have been 
previously stated. There is this caution only, and that has 

* Freeman, Frank N., "The Psychology of the Common Branches," Houghton 
Mifl3in Company; Suzzallo, Henry, "The Teaching of Spelling," Houghton 
Mifflin Company; Cook and O'Shea^ "The Child and His Spelling," Bobbs- 
Merrill Company, Indianapolis. 
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been anticipated by Ayres himself ; namely, that as the scale 
is used more and more with the same pupils, a teacher should 
expect that gradually the scores will become higher. This^ 
however, is quite satisfactory, since the words are of the right 
kind and since, by using the scale, the pupil's attention has 
been turned from unfamiliar, useless dictionary words to the 
words which he will use in his own work. 



BIBLIOGRAPHY 

1. Ayres, Leonard P., "The Spelling Vocabularies of Personal and 

Business Letters," Division of Education, Russell Sage Founda- 
tion. New York City. 

2. Ayres, Leonard P., "A Measuring Scale for Ability in Spelling," 

Division of Education, Russell Sage Foundation, New York City. 

3. Buckingham, B. R., "Spelling Ability. Its Measurement and 

Distribution," Teachers College, Colimibia University, New York 
City. 

4. Des Moines Annual School Report, 191 5. Section on "Spelling." 

5. Jones, W. Franklin, "Concrete Examination of the Material of 

English Spelling," University of South Dakota, Vermilion, South 
Dakota. 

6. Pryor, Hugh Clark, "A Suggested Minimal Spelling List," Chap. 

V, Part I, Sixteenth Yearbook of the National Society for the 
Study of Education. 

7. Rice, J. M., "The Futility of the Spelling Grind," Forum, Vol. 23, 

pp. 163, 409. 

8. Studley, C. K., and Ware, Allison, "Common Essentials in Spell- 

ing," Bulletin No. 7, State Normal School, Chico, California. 

9. Wallin, J. E. W., "Spelling Efficiency in Relation to Age, Grade, 

and Sex, and the Question of Transfer," Warwick and York, 
Baltimore, Maryland. 

10. The teacher or supervisor who is interested in the more intricate 

problems of establishing a spelling standard is referred to the 
following recent articles: Ballou, School and Society, 1917, Vol. 
5, pp. 267-270; Ballou, Educational Administration and Super- 
vision, 1915, Vol. I, pp. 469-472; Kallom, Educational Adminis- 
1>ration and Supervision, 1917, Vol. 3, pp. 539-542. 

11. Ashbaugh, Ernest J., "Iowa Spelling Scale," Extension Bulletin, 

Nos. 43, 54, and 55, University of Iowa. 



CHAPTER III 

THE MEASUREMENT 0¥ HANDWRITING 

The writing supervisor had given Wilbur a grade of 95. 
Wilbur was dissatisfied. When the supervisor next came to 
the building, Wilbur made known his dissatisfaction, and 
asked why his grade was not higher. The supervisor answered 
that 95% was a good grade, that she never gave 100%, and 
that there was opportunity for him to further improve his 
work. Wilbur answered that he had received 95% from the 
fourth grade up, and he knew that he was writing much 
better than in any previous grade. The supervisor had no 
conclusive or satisfactory argument. She resorted to her 
authority as teacher, and left Wilbur still dissatisfied. What 
teacher has not had a similar experience with reference to the 
grade in writmg? 

This situation is rapidly changing in the public schools. 
Writing can be definitely measured, and the ratings can be 
made so accurately that the pupils themselves fully under- 
stand and appreciate that exact justice has been done. This 
has been brought about by the development of scales for the 
measurement of handwriting. 

If a teacher has not been accustomed to make use of 
scales and standardized tests in her work of grading, she 
would do well to begin with the subject of writing. Writing 
is one of the mechanical subjects and one of the most easily 
and quickly measured. In order to avoid confusion on her 
part, she shoul4 study and practice scientific measurement in 
this subject alofte until she has become reasonably proficient. 
It will be well for the teacher to read through a large num- 
ber of the works mentioned in the bibliography at the close 
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of this chapter, and as a beginning in this work, particular 
attention is called to numbers i and 2. 

The first scale in handwriting was developed by Dr. E. L. 
Thorndike, of Teachers College. It is based upon general 
merit in handwriting as determined by the judgment of a 
large number of competent graders. Thomdike's scale is 
widely used at the present time, and many think that it 
gives more satisfactory results than any other. It had, 
originally, the disadvantage ^ of being mechanically in- 
convenient, and for that reason the Ayres scale has become 
much more widely used. 

The Ayres scale consists of twenty-four samples of writing, 
eight each of vertical, semi-slant, and full slant style. The 
scale is arranged on a heavy sheet of paper 9" high and 36" 
wide, in the form of the following diagram : 
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It is so convenient in form that it may be placed in the 
schoolroom, where pupils may compare their handwriting 
with it at any time. This is desirable, and it is recommended 
that every schoolroom in which there are intermediate and 
upper grade pupils should have a copy of the Ayres scale 
available for pupils as well as for teachers. (See pp. 28-35.) 

What to Measure. — Ordinarily the teacher will measure 
only two elements in handwriting ; namely, speed and quality. 
By speed is meant the number of letters written per minute. 
By quality is meant general merit, or what tMe teacher indi- 
cates when she gives a grade in writing. Speed is determined 

* A defect since remedied in large measure. 
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by simply counting the number of letters written during a 
given time and reducing to the one-minute basis. It is 
quality or general merit which is measured by the use of the 
writing scale. These terms are relatively simple, and their 
significance will appear during the further discussion. It 
is just as well for the teacher to begin by giving a regular 
test, and in this manner to apply herself to the work of master- 
ing the details of grading and evaluating papers in handwriting. 
Giving the Test. — In order to make the test valid for 
comparative purposes, uniform conditions must prevail. 
The rules of the game are simple, and the teacher should 
follow them carefully, since it is only in this way that valuable 
comparison will be made possible. The directions for tests 
in handwriting are so generally standardized at the present 
time that comparison is possible, not only within the class, 
but one room with another and even one school system with 
another. The invariable aim is to secure results in such form 
as to make them easily comparable with like results obtained 
elsewhere. The rules are as follows : 

1. The copy must be simple enough for second grade 
pupils. While it is not necessary to use the same copy each 
time, it should be similar in difficulty. A copy which has 
been much used is the hne : " Mary had a Uttle lamb." 
Others have used the entire first stanza of this selection. 
Another copy which has been used is " Sing a song of sixpence, 
a pocket full of rye." The idea is to have a simple, easily 
xmderstood copy, which will not deter the pupil in his speed 
test. Some tests have been given with copy which was too 
difficult, making the results in speed unsatisfactory for com- 
parative purposes. 

2. Before the test is given, the copy should be memorized 
by all of the pupils. The purpose of the test is to determine 
speed and quality of handwriting. If the pupil must stop 
and think, he falls behind in speed. In one survey a rather 
difficult copy was placed in the hands of the pupils. They 
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were instructed to write the copy, repeating the same during 
the period of the test. The results were so unsatisfactory 
that speed was not reported upon by the survey committee. 
In addition to having the copy committed, it is a good plan 
to place the same upon the blackboard at several different 
places, so that any pupil who does happen to forget for a 
moment may reassure himself by a glance at the copy. 

3. The time for the test should be exactly two minutes. 
In order to make sure that all pupils start together, it is well 
to rehearse the details before actually starting the test. This 
makes sure that all pupils understand, clears away any con- 
fusion, and so secures the test papers in reliable form. 

4. Everything should be in readiness for the test before the 
pupils begin. This means that every pupil must have paper, 
a good pen, ink, and the copy committed. In order to make 
sure that all have pens, it is well to ask every pupil in the 
room to hold up the pen (or pencil, if used in second or third 
grade). Since the teacher will want to use the results of the 
test for the benefit of individual pupils, it is well at this point 
to place certain items at the head of the paper. The usual 
items are — name, grade, building, city, and date. If for 
any reason it is desired to make the test impersonal, these 
items may be omitted, or placed on a separate card with a 
number scheme as a key. 

5. When all is ready, the teacher gives some simple direc- 
tions. " Write as well as you can at your usual speed, using 
the following copy : * Mary had a little lamb.' Write the 
copy again and again until I say * stop.' At the command, 
stop at once, even if in the middle of a letter." After this 
explanation has been given the teacher says, " All in position. 
Dip the pens. Pens up. Begin." 

6. In exactly two minutes, pupils should be given the 
order to stop, and required to place their pens on the desk. 

7. At this point the teacher may save herself considerable 
work by having the pupils count the number of letters in the 
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copy. It is suggested that pupils place this number below 
the copy to the right, using pencil for the same, and then 
divide the number by two, thus reducing the score to a one- 
minute basis, as 2)146. The papers may then be collected 

73 
in the usual manner. 

Scoring for Speed. — The speed is calculated in terms of the 
number of letters written per minute. The test is given over 
a two-minute period in order to reduce the error. Some 
examiners have used other units, as three or four minutes, but 
evidence is not at hand that the results have been improved. 
In the first report upon speed in handwriting,^ two minutes 
was made the basis of the test, and this imit has quite generally 
been used in later tests. The practice is common, also, of 
reducing to the one-minute basis, thus making comparison easy. 

The speed measurement is secured by counting the letters 
in the pupil's copy and dividing by two. Although the 
pupils have been asked to count the number of letters, the 
teacher should carefully check the results. The teacher may 
reduce her work by knowing the total number of letters in the 
copy used, multiplying by the number of repetitions of the 
full copy, then adding the extra letters. Suppose a particular 
pupil has written the copy, " Mary had a little lamb," eight 
times, and has written the first three words the ninth time. 
The teacher in figuring the number of letters will multiply 18 
by 8 which gives her 144 and then add the number of letters 
in the three words, — " Mary had a," namely, eight. This 
gives a total of 152 letters. Dividing by 2 she gets the pupil's 
score, 76 letters per minute. In case the teacher gets a result 
different from the pupiPs result, the same should be placed 
in the lower right-hand corner, the pupil's figure being crossed 
out. This completes the scoring of the papers for speed. 

* Wilson, G. M., "The Handwriting of School Children," Elementary School 
Teacher y ii : pp. 540-543. This is the first known attempt to fix a standard 
for speed in handwriting. 
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Fig. I. — Ayres handwriting scale (pp. 28-35). The copy shown herewith 
is the so-called Gettysburg edition. 
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Scoring for Qtiality. — The teacher will be surprised how 
quickly she can learn to grade papers by using the Ayres 
scale. While it is helpful to have a demonstration and some 
practice in a teachers' meeting, this is not at all necessary, 
and the teacher who is patient and willing can train herself 
very quickly to use this scale and to secure satisfactory results. 
The teacher should give herself preliminary drill of at least an 
hour or two. If this drill is divided into half hour periods, 
and continued during a considerable part of a week, the 
teacher will become reasonably uniform in grading papers, 
and will feel competent to score the papers from the test in 
her room. At this point it would be well for her to consult an 
expert, in case one is available. This expert by a little observ- 
ing and advising will correct any marked defect, — such as a 
uniform tendency to grade too low or too high. In the absence, 
however, of a teacher, a supervisor, or a superintendent, in the 
system, who can give this expert help, a teacher need not be 
deterred. She can master the details, working entirely alone. 

Directions for grading a sample, while not uniform, have 
in mind the common object of helping the teacher to locate 
the specimen on the scale which most nearly corresponds in 
merit with the pupil's copy. Apparently the best way to do 
this is to glide the pupiPs copy back and forth underneath the 
scale, comparing it with one sample after another in the scale 
until a decision is reached as to which sample most nearly 
corresponds with the pupil's copy. The teacher will fre- 
quently have difl&culty, and especially where the pupil's copy 
is better, for example, than 50 on the Ayres scale, but not as 
good as 60. Some scorers recommend the use of intermediate 
units in such cases, permitting the teacher thus to indicate 
54, 56, or whatever the proper value may appear to be. 
Practice on this point varies. If the number of papers to be 
scored is not too large, intermediate values may be used. 

The score for quality when determined upon should be 
placed in the upper right-hand corner of the paper. 
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Recording the Scores. — From the beginning the teacher 
should acquire the habit of distributing her scores, showing 
both speed and quality on a single sheet. This will be found 
exceedingly helpful. Table 3, which follows herewith, shows 
such a distribution for a sixth grade. By reference to this, 
it will be seen that of the 33 pupils in the grade, 2 are writing 
at quality 20 (see totals at the bottom of the sheet), 4 at 
quality 30, 5 at quality 40, 8 at quality 50, 8 at quality 60, 
S at quality 70, and i at quality 80, The middle • score 
on the basis of quality will fall therefore in the group of 8 at 
50 and this is noted below as the median quality. 

Table 3. — Distkibution of Scores for a Sixth Grade 





.. 


.. 


4= 
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7= 


80 
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at- 30 
31- 40 
41- 50 
51- 60 
61- 70 
71-80 
81- 90 
91-100 

I 21-140 
141-160 
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181-200 








' 


■ 


\ 


^ 


3 

I 


^ 
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Totals for Qualiiy 
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5 


3,i 



Median Quality — 50 Median Speed — 56 

The totals for speed are indicated in the right-hand column. 
It is observed that the median speed falls between 51 and 60. 
In this particular case, however, the teacher has determined 



> See explanation o( middle 
33 papers, the middle 
either end. 



this c 



or median, p 261. Since there are 
e will be that of the 1 7th paper from 
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the exact median for speed, and it is recorded below as 56. 
To determine the exact median for speed all that is necessary 
is to arrange the papers in order, from lowest to highest on the 
basis of speed, then count in to the middle paper. In this 
particular case the middle paper would be the 17th one from 
either end, and it appears that the 17th one had a speed of 
56 letters per minute. 

Standard Scores. — With the scores fully tabled the 
teacher's next question naturally is, " How does the writii^ 
of my pupils compare with others, and what are the 
standards? " She wonders if sixth grade pupils should show 
a range in quality from 20 to 80, and if a median quality of 50 
is too low. In speed she notes that they are distributed from 
less than 30 to nearly 100. This means that some of the 
pupils are writing three times as rapidly as others. How 
rapidly should they write? So far as known this question 
was first raised only six years ago, and at that time a tentative 
standard for speed was indicated on the basis of results from a 
single city system. 

Table 4. — Standasds of Speed ' 
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B 


e 
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8 


1. Cleveland 

2. Kansas City (May, 1915) . 

3. Denver Survey . . . . 

4. South Bend (May) . . . 
S- Freeman's 56 cities . . . 

6. Brookline 

7. Newton 

8. Missouri Training Schools . 
g. Iowa, 33.569 children . . 




33 
31 

3<) 


53 
36 
48 
44 


64 

SO 

S' 
62 


60 

6g 

54 
77 
5Q 
76 
73 
80 
63 


70 
76 
63 
82 
63 
87 
85 
qa 
73 


76 

76.5 

66 

93 

68 

go 

<)4 

Q2 

75 


80 
69 

73 
98 

76 



Now, however, it is possible to indicate a standard based 
upon results obtained from all parts of the country, and to 

' Decimals largely omitted. 
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indicate rather definitely how well pupils in any particular 
grade should write. 

Table 4, given herewith, shows the median attainment in 
speed for Cleveland, Kansas City, Denver, South Bend, 
fifty-six cities combined, Brookline, Newton, the Missouri 
Training Schools, and over 33,000 Iowa children. 

From this table it will be seen that sixth grade children 
from different parts of the country are averaging from 63 
up to 92 letters per minute. It should be noted, however, that 
the 82 for South Bend is a May average and was secured by 
special attention after a test given earher in the year had 
shown the need for improvement. It is apparent, then, 
that the particular sixth grade shown in Table 3 is quite 
definitely below standard, if we take as a standard the per- 
formance of other sixth grade children throughout the country. 
In this connection, it may be well to note two proposed 
standards made by men who have given considerable thought 
and atten^on to the subject. 

Table 5. — Standards for Speed in Handwriting 



Grades 



Freeman 
Starch 



2 


8 


i 


5 


6 


7 


36 

31 


48 
38 


56 

47 


65 

57 


72 
65 


80 

75 



8 



90 
83 



Tables 4 and 5 will give plenty of opportunity for com- 
parison with actual performance and with proposed standards, 
to enable the teacher to judge of the writing in her own room. 
It appears that the median speed of 56 for her sixth grade is 
lower than the sixth grade median of any system appearing 
in Table 4, and indicates that the teacher should increase the 
speed of writing in this particular grade. She should at 
least aim to reach 63, the average of Freeman's 56 cities, the 
average also for Denver and the lowest sixth grade median 
appearing in Tables 4 or 5. 
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Standards for Quality. — In measuring quality for com- 
parative purposes it is necessary to use one of the standard 
scales of handwriting. Not all studies in the measurement 
of handwriting have made use of the Ayres scale, but Table 
6, given herewith, shows several returns in the Ayres scale 
and will permit comparison. 

Table 6. — Quality in Handwriting (Ayres) 



Gmadeb 
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» 
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Brookline 

Cleveland 

Denver 

Newton 

South Bend (May) .... 
Missouri Training Schools . 

Iowa median 

Freeman, 56 cities .... 


28 


45 
44 


i6 

49 

40 
47 


31 

49 

44 
50 


44 
45 

40 
41 
49 

53 


46 
48 
43 
51 
S3 
42 
53 
SO 


47 

50 
51 
io 
56 

4S 
57 
64 


49 

5S 
57 

S3 
54 
47 
61 
70 



It will be observed from this table that quality in hand- 
writing for the sixth grade has ranged from 42 in the Missouri 
Training Schools to 59 in the 56 cities reported by Freeman, 
It appears therefore that the particular sixth grade reported 
in Table 3, is writing better than the sixth grade pupils in the 
Missouri Training Schools, Brookline, Cleveland, and Denver, 
but not so well as those in South Bend, Newton, Iowa, or 
Freeman's 56 cities. 

The standards proposed by Freeman and Starch for quality 
are likewise given herewith : 

Table 7. — Standards of Qdality in Handwriting 



Grades 


. 


> 
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. 
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Freeman 

Starch 


44 

27 


47 
33 


50 
37 


55 
43 


59 
47 


64 

53 


70 
S7 
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It will be observed that the particular sixth grade writes 
better than the standard indicated by Starch, but not so well 
as the standard indicated by Freeman. 

Social Standard of Writing. — In attempting to set up 
standards, there is one danger which school people are likely 
to encounter, and that is the danger of considering writing as 
a school exercise, wholly apart from the social and business 
demands of life outside the school. In the last analysis it is 
this latter which should determine the proper standards. 
While it is difficult to get at the standards required by society, 
there are at least some evidences of social standards of hand- 
writing. Dr. Ayres has constructed a special handwriting 
scale for the Municipal Civil Service Commission of New 
York City. On the basis of this scale, the Commission con- 
siders that applicants pass in handwriting if they make 
a grade corresponding to quality 40 of the Ayres public school 
scale. Where handwriting is a special requirement a grade 
equal to quality 50 is required. These standards are lower 
than the Freeman standard for the sixth grade, and correspond 
fairly well with the Starch standard. However, sixth grade 
pupils will be in school two years longer, and under the present 
regime will write and continue to improve their writing for 
two years. This naturally raises the question as to whether 
the school standard for handwriting is not an artificial one, 
whereas it should be based directly upon the demands of 
society. 

There is additional evidence on this matter, as reported 
on page 24 of the First Iowa Elimination Report, as follows : 
** One himdred graduate students of Teachers College wrote at 
a median quality less than 50. Three hundred Indiana teachers 
in Perry, Green, and Ripley Counties wrote at median quali- 
ties less than 50. One hundred inquiries for help received by 
the Social Service Bureau of New York City showed a median 
quality less than 50. One hundred applications for positions 
ranging from $10 a week to $5000 a year, received by the 
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Social Service Bureau of New York City, showed a median 
quality of 60. Signatures on 100 bank checks showed a 
median quality of 41. 256 signatures on a hotel register 
showed a median quaUty of 41.1." It appears from the above 
that the adult social standard is fully satisfied by a quality of 
50 for practically all purposes. Even in the case of appli- 
cants for positions, where there is a special incentive for good 
writing, the median rises only to 60. On the basis of social 
usage, therefore, it appears that a quality of 60 on the Ayres 
scale should be accepted as satisfactory for any grade of 
school work, and that when pupils have attained a quality of 
60, with reasonable speed, they should be excused from further 
writing drill unless a pupil voluntarily chooses to continue. 
It will be observed from Table 6 that most 7th and 8th grade 
medians fall between 50 and 60. A quality of 60 therefore 
appears reasonable and attainable for upper grades. A 
higher standard except for special commercial positions would 
be artificial and unreasonable. 

What should be accepted as a reasonable speed from the 
standpoint of society has not been determined in any authori- 
tative manner. It is quite probable that a speed of 60 or 70 
letters per minute is sufficient to meet almost any situation. 
It would seem, therefore, that a teacher who brings her 
pupils to a quality of 60 and a speed of 60 has prepared them to 
meet the handwriting demands of society. Many pupils, 
because of special interests or superior abilities, will prefer 
to go above this, easily meeting the extreme social demands 
where handwriting of superior quaUty is required. 

Remedial Instruction. — When the sixth grade teacher has 
distributed her scores as shown in Table 3, and has decided 
what should be considered a reasonable standard in speed and 
quality for sixth grade pupils, her next question is how to 
remedy the situation for the pupils who are below standard 
in speed and quality. Studies have indicated that merely 
extending the time for the writing work will not solve the 
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problem. In fact, there is much evidence that children write 
too much and fall into careless habits for that reason. The 
story of how to remedy the defects is a long one, and will not 
be taken up fully in this discussion. The teacher is referred 
to other sources, particularly to the " Teaching of Hand- 
writing," by Frank N. Freeman. There are certain phases of 
the work of remedying defects, however, which have been 
subjected to definite measurement. 

Freeman has constructed a series of writing scales or charts, 
based upon the most common defects of the pupils' writing. 
These scales or charts deal respectively with — i, Uniformity 
of slant; 2, Uniformity of alignment; 3, QuaUty of line; 
4, Letter formation; 5, Spacing. Each chart contains 
three qualities of excellence, illustrating good, average; and 
poor qualities of handwriting from the standpoint of the 
characteristic dealt with in the particular chart. 

The teacher who is especially interested in writing, and 
especially the writing supervisor, will find it worth while to 
make use of Freeman's analytical charts. By carefully 
selecting samples of the pupils' writing she can for her own 
use make up charts similar to the Freeman charts, thus 
having available for showing to the pupils samples that 
illustrate desirable and undesirable features under uniformity 
of slant, uniformity of alignment, etc. 

Table 8, given herewith, should prove especially helpful, 
as it indicates the causes for the various defects. The teacher 
and pupil should work together in applying this table to the 
pupil's writing. If a pupil is writing with too much slant, 
the teacher will do well to study the pupil in the light of 
the five suggested causes. It may be a matter so simple as 
having the paper in the wrong position — and so with other 
defects. It is a matter of studying the situation with the 
particular pupil, analyzing the defect, finding the cause, and 
helping the pupil to apply the remedy. 
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Table 8. — Analysis of Defects in Writing and Their Causes^ 



Defect 

I. Too much slant . . 



2. Writing too straight 



3. Writing too heavy . 



4. Writing too light . 



5. Writing too angular 



6. Writing too irregular 



7. Spacing too wide . 



Causes 
(i) Writing arm too near body. 

(2) Thumb too stiflF. 

(3) Point of nib too far from fingers. 

(4) Paper in wrong position. 

(5) Stroke in wrong direction, 
(i) Arm too far from body. 

(2) Fingers too near nib. 

(3) Index finger alone guiding pen. 

(4) Incorrect position of paper. 

(i) Index finger pressing too heavily. 

(2) Using wrong pen. 

(3) Penholder too small diameter. 

(i) Pen held too obhquely or too straight. 

(2) Eyelet of pen turned side. 

(3) Penholder too large diameter, 
(i) Thumb too stiflF. 

(2) Penholder too lightly held. 

(3) Movement too slow. 

(i) Lack of freedom of movement. 

(2) Movement of hand too slow. 

(3) Pen gripping. 

(4) Incorrect or uncomfortable position, 
(i) Pen progresses too fast to right. 

(2) Too much lateral movement. 



The teacher may find it advisable to extend the list of 
defects, and this can doubtless best be done by making use of 
the analytical score card for handwriting, developed by Dr. C. 
Truman Gray of the University of Texas. It is indicated 
herewith. Figure 2. Dr. Gray's score card is in many respects 
more complete than the detail of defects listed by Dr. Free- 
man. 

The teacher will do well to enlist the pupil fully in the 
attempt to improve his writing. For the most part the pupil 

* F. N. Freeman's **The Teaching of Handwriting" in the Riverside 
Educational Monographs, page 72, published by Houghton, Mifflin Company. 
By special permission of the publishers. 
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- Standard Scobe Card foe Jodoing HAKDvnuTivc 
(Devised by C. Truman Gray) 



Pupa. . . 

Grade. . 
Teacher 



t 


Heaviness . . . 




Slant 




Uniformity . . 




Mined .... 


s 


Size 




Ilnifnmiily 




Too large . . . 




Too small . . . 


4 


Alignment . . . 


■? 


Spacing of Unes . 




Uniformity . . 




Too close . . . 




Too far apart 


f. 


Spacing of words . 




Uniformity . . 




Too close . . . 




Too far apart . 


7 


Spacing of letters . 




Uniformity . . 




Too dose . . . 




Too far apart . 


« 


Neatness .... 




Blotches . . . 




Carelessness . . 




Formation of letters 




General form 




Smoothness . . 




Letters not closed 




Parts omitted . 




Parts added . . 




Total score . 
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simply knows that his writing is poor. He doesn't know why 
it is poor, and he is given no help in applying proper remedies. 
If he realizes, for instance, that it is a question of slant, or of 
uniformity in spacing, or uniformity in height, or neatness, — 
that is, if he can be made to place his attention upon some 
particular defect and work toward the correction of that 
defect, he can feel that he is working toward some definite 
end and not merely drilling aimlessly upon writing. The 
teacher's business here is to teach, not to scold, not to find 
fault. The teacher may not find it advisable to use the Gray 
score card, so far as actually scoring the pupils' work is con- 
cerned, but she can use it along with Freeman's suggestions 
in discovering with the pupil the defects which need remedy- 
ing. In time the teacher may be able to construct a chart 
showing letter defects similar to Freeman's, but made up en- 
tirely from work of her own pupils. Freeman's chart ^ shows 
the correct form of a letter, together with the usual defects. 
It will help to furnish an answer to the pupil's " Why," 
when he asks why he was marked down in writing. All 
pupils appreciate being treated with consideration and given 
an opportunity of doing a reasonable amount of thinking in 
connection with their work. 

Locating the Individual. — The discussion under remedial 
instruction shows the necessity of locating the individual. 
It is suggested that the teacher be not satisfied with the 
distribution as indicated in Table 3, but go a step farther, 
placing in the names of the particular pupils, as in Table 9. 
This will individualize the work, and will also make it more 
intelligible to the children. Raising the score in quality for 
her room then becomes a question not of blind unintelligent 
drill, but a question of improving the work of John, Mary, 
Jane, William, etc. In fact, taking the particular sixth grade 
as an example, and accepting quaUty 60 as the standard, it is 
observed that 14 of the pupils are already writing satisfactorily. 

^ The Teaching of Handwriting, page 135. 
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From the standpoint of speed, 1 2 are writing above 60 and it is 
possible that some of the 8 writing between 51 and 60 are on 
a satisfactory basis. This analysis of the situation Umits 
the teacher's efforts to particular pupils, and enables her 
to apply her instruction where it is most needed. It also 
eliminates useless drill. At least two of the pupils writing 
at quaUty 60 or above are below in speed. These are Jeanette 
and Mark. Four others, Grace, Lily, Henry, and David, are 
also below in speed or just on the line. Four who are satis- 
factory in speed are below in quaUty. These are Bruce, 
Ruth, Bert, and Thomas. The eight to the right and below 
the heavy Unes are satisfactory in speed and quality, and 
further drill by them may be left to choice. If this plan 
were generally followed in school systems, a large amount 
of effort would be released in handwriting alone, for appUca- 
tion along other needed lines. 

Table 9. — Distribution of Scores for a Sixth Grade 
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21- 30 . . 


John 
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31- 40 . . 




Jane 


One 


Kate 




Mark 
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41- 50 . . 


William 


Luther 


Sarah 
Epsie 


Carrie 
Hazel 


Jeanette 
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$1- 60 . . 




Wilber 


Bertha 


Joe 
Paul 


Grace 

Lily 

Henry 


David 
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61- 70 . . 






Bruce 


Ruth 


Eldon 


Bess 
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Bert 


Ina 


Frank 








71-80 . . 








Thomas 


Mildred 




Doris 
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81- 90 . . 












Helen 






I 


91-100 . . 










Jacob 
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I0I-I20 . . 
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Proportion of Children at Standard Quality. — Figure 3, 
given herewith, shows a distribution of upper grade pupils 
in Cleveland, Ohio. Computation shows that 3303 of the 



2700 




Fig. 3. — Number of pupils writing at each quality from 20 to 90. 
Data from 10,528 pupils in four upper grades (Cleveland Survey, p. 70, 
" Measuring the Work of the Public Schools ")• 31.3 % at 60 or above. 

children, or a total of 31.3%, were writing at quaUty 60 or 
above. The Springfield, Illinois, survey showed that Z2>'Z% ^f 
the upper elementary grade pupils were writing at 60 or above. 
In the Butte, Montana, survey 23.8% of the pupils in grades 
2 to 8 were writing at quality 60 or above. In Kansas City, 
in 1915, 16.4% of all pupils were writing at quality 60 or 
above. In the three upper grades in Kansas City these 
percentages were as follows: 



Fifth grade 
Sixth grade 
Seventh grade 



25.1% at quality 60 or above. 
39.7% at quaUty 60 or above. 
48.4% at quality 60 or above. 
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This means that in the seventh grade in the Kansas City 
schools, practically half of the children were writing at a 
satisfactory standard of quality, and should have been excused 
from further drill. 

These figures taken from city reports and surveys make it 
evident that many upper grade pupils should properly be 
excused from further writing drill, and that our illustrative 
sixth grade throughout this chapter is quite representative in 
its distribution of writing abiUty in an intermediate or upper 
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47 


50 


60 


70 


81 


93 



Fig. 4. — Speed records of 36 sixth grades, Cleveland. 



grade. The procedure recommended for this grade should be 
applied quite generally. Some teachers, however, may 
want to require pupils to reach and maintain in all written 
work a quality somewhere above 60, even as high as 70, 
before excusing them from further drill in writing. Some 
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pupils who are excused from drill may prefer to continue 
until a higher standard is reached. 

The Writing of an Entire School System. — Above in Figure 
3, the writing scores for an entire school system above 
the third grade are thrown together into a single distribution. 
There are various ways in which these data for a school sys- 
tem may be used to advantage, the following being particu- 
larly useful : 

I. A grade in one building or part of the city may be 
compared with the same grade in other buildings or parts of 
the city. Figures 4 and 5 show this detail in median 
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Fig. 5. — Quality records of 36 sixth grades, Cleveland. 



speed and quality for the sixth grade of the Cleveland 
schools. Any teacher may locate her particular grade in 
these distributions, and so see its rank in terms of the median 
scores. 

2. A city may be compared with another city or with an 
established norm or standard. Table 10 will aid in this work. 
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It is valuable as a means of showing quickly and forcefully 
the relative standing of the city in question. Here Cleve- 
land is compared with 12 other cities in speed and quality, 
as follows : 

Table 10. — Speed and Quality in Handwriting, Cleveland, 

Ohio 





Average Speed 


12 other cities . . 
Cleveland . . . 


5th grade 

57 
62 


6th grade 

6s 
69 


7th grade 

75 
73 


8th grade 

83 
78 




Average QuALrry 


12 other cities . . 
Cleveland . . . 


5th grade 

43 
45 


6th grade 

47 
48 


7th grade 

53 
50 


8th grade 

57 
55 



The Freeman standards have been much used for such 
comparisons. 

3. After building scores and medians have been ascertained, 
a superintendent of a school system may desire a total city 
summary. Table 11, following herewith, permits comparison 
of the writing in any particular building with writing in the 
other buildings and the city average. This will be particularly 
interesting and stimulating to teachers, principals, and super- 
visors. 

In speed, Cleveland excels the other cities in grades five 
and six, but is below in grades seven and eight. However, 
there is some evidence that Cleveland is more nearly right 
in the matter of speed in writing than is the average of the 
twelve other cities. Likewise in quality, grades five and six 
of the Cleveland schools do better than the average of the 
twelve other cities, but grades seven and eight are below the 
average of the twelve cities. 
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Emerson School i 9.6 
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10.9 
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Forest School . . 
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Grant School . . 
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Hamilton School . 








II.9 


; 10. 1 


II. 5 


12.9 


12.5 




Jackson School 








10.7 


10.7 


9-9 


10.5 


11.4 


13. 


JeflFerson School . 










, 9.5 


II-3 


11.5 


11.3 


11.6 


Lafayette School . 








1 10.5 


: "3 


10.6 


10.3 


12.2 


14.7 


Lincoln School 








9.0 


i 9.2 


9.0 


II. 


II. 2 




Lowell School . . 








8.6 


i 10.6 


II. 7 


11.8 


14. 


14.6 


Onequa School 








10.5 


i II.6 


10.9 


9.9 


12.2 


13.5 


Oquirrh School . . 








8.7 


i IO-7 


12.2 


13.3 


12. 1 




Poplar Grove School 










9.5 


9.8 


11.3 


11.6 


12.4 


Riverside School . . 








9-4 


12.7 


9.8 


II. 


12. 


12.2 


Summer School . . 








10.2 


13.8 


12.4 


12.2 


12.7 


13.9 


Training School . . 








7.1 


9.0 


9.8 


9.6 


11.6 


12.5 


Wasatch School . . 










12.7 


13.4 


11.3 


12.4 


12.3 


Washington School . 








8.9 


9.7 


9.5 


10.7 


II. 2 




Webster School . . 








7.6 


II. I 


10.7 


12. 1 


12.8 


11.6 


Whittier School . . 








9.1 


11.7 


11.4 


12.0 


12.8 


14.7 


For the City . . . 


9.2 


10.7 


II.O 


11.3 


12.2 


12.8 



Table 12 shows a total distribution of quality in writing 
for Salt Lake City. It will be observed that this table gives 
a different view from the distribution of medians shown for 
Cleveland in Table 6. It gives a worth while bird's-eye 
view of the writing for the entire city. This distribution is 
particularly valuable to the superintendent and supervisors in 
showing the work yet needed on handwriting. Quality 12 
of the Thomdike scale corresponds to 60 of the Ayxes scale. 



^ The scores in Tables 11 and 12 are in terms of the Thomdike scale to be 
explained further on in the chapter. 
2 Salt Lake City Survey, page 148. 
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Table 12. — The Distribution of Scores in Quality on 3685 
Samples of Penmanship by Grades. (Salt Lake City) * 



Score 



4 

5 

6 

7 

8 

9 

10 

II 

12 

13 

14 

15 

16 

17 

18 

Number of samples . , 

Median score for grade 



hi 


IV 


V 


VI 


VII 


3 










4 










21 


5 




3 




55 


30 


3 


3 


2 


85 


63 


59 


26 


8 


196 


175 


147 


117 


70 


46 


37 


23 


38 


12 


102 


152 


190 


53 


163 


44 


60 


65 


92 


91 


39 


lOI 


98 


87 


189 


II 


38 


41 


52 


68 


4 


12 


15 


20 


31 


4 


9 


4 


10 


24 




4 


I 




2 




I 




I 


2 


616 


687 


646 


602 


662 


9.2 


10.7 


II.O 


11.3 


12.2 



vin 



28 
4 

97 
81 

84 
50 

35 
61 
10 

22 



472 



12.8 



Pupils writing above quality 12 should be excused from further 
drill, except voluntary drill. 

The Thomdike Scale. — While it is assumed that the 
teacher will doubtless use the Ayres scale, because of its 
convenience and availability, yet teachers should know of the 
Thorndike Scale, and should appreciate the fact that it was 
Dr. E. L. Thomdike who first gave us a usable scale for hand- 
writing. 

The Thomdike scale is based upon general merit, as 
determined by the judgment of a large number of competent 
judges. In this respect it differs from the Ayres scale, which 
is based entirely upon legibility. It is unnecessary at this 
point to go into the discussion of the merits of the two scales. 

^ Salt Lake City Survey, page 149. 
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It is agreed that either scale can be understood, and will give 
much better results than the old method of grading. Because 
the Thomdike scale was first developed, and its value was 
immediately appreciated by school men, it was introduced 
into a large number of school systems, and is still retained in 
many of them. For this reason it will be well to indicate 
standards of quaUty according to the Thomdike scale. 
The numbers are quite definite since the samples on the 
Thomdike scale range from 4 to i8. The Thorndike scale 
was used in the Butte survey, and Table 13 shows the 
complete distribution of scores in quaUty. 

Table 13. — The Distribution of Scores in Penmanship (Butte 

Survey, p. 165) 



Score (Quality) 



Grade 



O 

I 

2 

3 

4 

5 

6 

7 

8 

9 

10 

II 

12 

13 

14 

15 

16 

17 

18 

Total papers . 
Median scores 



3 



5 
22 

21 

29 

28 

42 

7 
29 

5 

7 



196 



2 

2 

21 

44 
86 

41 
8 

13 
2 

2 



8.2 



221 



8.0 



3 
16 

24 
42 

55 
20 

21 

15 
2 

3 
I 



202 



3 

3 
12 

56 
61 
16 

17 

15 
6 

4 



6 



8.8 



194 



8.9 



2 

I 
20 

25 
9 

32 
44 

17 

10 

9 
10 

6 

3 



3 

15 

29 
II 

25 
12 

19 
16 

6 

12 

2 

I 



8 



I 

3 

7 

15 

I 

23 
21 

9 

9 

15 

17 

3 



188 



11.6 



152 



II. 2 



124 



12. 1 
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Table 14 shows the median performance in certain cities 
and indicates also the Freeman standard, expressed in imits 
of the Thorndike scale. 

Table 14. — Quality of Handwriting (Thorndike) 





I 


II 


III 


IV 


V 


VI 


VII 


VIM 


Connersville, Indiana 


10.3 


lO.O 


10.3 


II.7 


II.7 


II 


Butte, Montana . . 




8.2 


8.0 


8.8 


8.9 


II.6 


II. 2 


12. 1 


Salt Lake City . . 






9.2 


10.7 


II. I 


11.3 


12.2 


12.8 


Kansas City . . . 


7.2 


7.4 


8.4 


9.3 


10.4 


II.O 


II.4 




Freeman's 56 cities ^ . 




7.8 


9.4 


10.2 


II. 2 


11.6 


12.5 


13.4 


Freeman's standard ^ 




8.4 


9.8 


10.9 


12.0 


12.8 


13-9 


15.2 



Table 15, which follows, contains a complete table of trans- 
formation, by which the qualities in the Ayres scale may be 
transformed into the Thorndike scale and vice versa. 



Table 15. — Comparative Values.^ 



Aye&s 


Thorndike 


Thorndtkf. 


Ay&es 


20 


6.33 


5 


9.5 


30 


7.60 


6 


17.4 


40 


8.86 


7 


25.3 


50 


10.13 


8 


32.2 


60 


11.39 


9 


41. 1 


70 


12.66 


10 


49. 


80 


13.93 


II 


56.9 


QO 


15.19 


12 


64.8 






13 


72.7 






14 


80.6 






15 


88.5 






16 


96.4 



* Transformed scores, approximate only. 

^ Dr. T. L. Kelley, Journal of Educational Psychology, December, 1914. 
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Lister-Meyers Handwriting Scales. — These scales are in 
use in the schools of Greater New York. They were pre- 
pared by Professors Lister and Meyers of the Brooklsm 
Training School for Teachers. They are printed on a sheet 
24''X26'' and show rankings from 90 to 20 on the three 
items : form, movement, and spacing. This scale is a good 
illustration of a special adaptation based upon the type of 
writing which the supervisors are endeavoring to secure in 
the particular city. 

The teacher who is merely interested in putting her grading 
system on a scientific basis may neglect some of the present 
discussion and may secure good results by simply following 
the rules laid down for giving the tests, scoring the results, 
distributing the scores, and applying remedial instruction. 
What now seems theoretical and abstract in the measure- 
ment of handwriting will take on new significance as the 
teacher gradually masters the details of applying the work 
to her own schoolrooms. The practice will illuminate the 
theory ; that which is theoretical will become practical. The 
work is of value as it modifies and improves school practice. 
Many teachers, however, will desire to know the history and 
development of the work, and in addition to a thorough 
study of the present chapter, will use the following bibli- 
ography to further study the subject. 
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CHAPTER IV 

THE MEASUREMENT OF ARITHMETIC 

Measurement in arithmetic is not so simple as in spelling 
or in handwriting. Arithmetic taken as a whole involves 
many processes, and each process in turn involves particular 
difficulties. No one has attempted to measure general 
mathematical ability,^ and no one has attempted a test which 
covers the entire range of arithmetical processes. However, 
in its more essential and mechanical phases, arithmetic is 
susceptible of quite definite measurement. Since our purpose 
is to help the teacher in measuring her work to the extent that 
scales and standardized tests are available, it will be necessary 
to consider what can be measured at the present time, and 
what are the means available for doing it. 

There are at the present time five series of tests reasonably 
well standardized. The one first developed and most exten- 
sively in use is the Courtis Standard Research Tests. At the 
present time Courtis is confining his work to measurement 
of the four fundamental processes by his tests known as 
" Series B." 

The Courtis tests were first made available during the 
school year of 1909-10. The first tests, known as " Series 
A," were tentative in nature, and have since been discontinued. 
The use of Series B has extended very rapidly. During the 

* The discovery of mathematical ability among secondary pupils has re- 
cently been attempted and a prognostic test devised. See Rogers, Agnes Low : 
"Experimental Tests of Mathematical Ability and Their Prognostic Value." 
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year 191 5-1 6 they were used in forty- two states of the Union, 
Hawaii, and two foreign countries, a total of one half million 
copies being sold. The teacher will see, therefore, that in 
making use of the Courtis tests she is becoming acquainted 
with a method of measurement which is used widely and is 
likely to be used even more extensively in the future, unless 
they are replaced by something better. The chief attention 
of the teacher will be directed to these tests. 

Courtis Arithmetic Tests, Series B. — Series B consists 
of tests in addition, subtraction, multiplication, and divi- 
sion, respectively. The test in addition consists of twenty- 
four examples, each made up of nine three-place numbers. 
They are constructed mechanically in such a way that 
each example is equal in difficulty to every other. The 
addition problems, therefore, consist of 24 different units of 
measurement, more or less in the nature of 24 foot rules, 
although not as accurately equal one to the other. The 
point of the test is to see how many of these examples can be 
solved by a pupil in a given time, — the time in this case 
being eight minutes. The examples in subtraction, multi- 
plication, and division are likewise made up on the basis of 
uniform difficulty and a definite time limit is set. The details, 
which follow herewith, give a few samples from each of the 
tests, together with the time limits. 



Arithmetic. Test No. i. Addition score 

No. Attempted- - 




Series B Form 2 ^.t «• 1.* 

No. Right-. 



You will be given eight minutes to find the answers to as many 
of these addition examples as possible. Write the answers on this 
paper directly underneath the examples. You are not expected 
to be able to do them all. You will be marked for both speed and 
accuracy, but it is more important to have your answers right than 
to try a great many examples. 
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127 


996 


237 


386 


186 


474 


877 


637 


376 


320 


949 


463 


776 


787 


846 


686 


963 


778 


486 


827 


684 


691 


981 


462 


333 


886 


987 


240 


260 


106 


693 


904 


326 


913 


364 


616 


372 


869 


184 


611 


911 


164 


600 


261 


846 


461 


772 


988 


664 


897 . 


744 


766 


696 


336 


749 


669 


167 


972 


196 


833 


264 


820 


266 


127 


664 


119 


234 


969 


137 


633 


268 


323 



Test No. 2. Subtraction 

The test consists of 24 examples like the following, time 4 
minutes. 



97089301 
20203267 



93994413 
64783938 



108061861 
73463849 



163130669 
91061266 



168364186 
70637861 



188646364 
92471269 



120981427 
64188046 



106766782 
90863147 



Test No. 3. Multiplication 

The test consists of 25 examples like the following, time 6 
minutes. 



6283 
47 



9624 
603 



7863 
36 



4926 
620 



6873 
49 



2964 
94 



8367 
87 



6249 
78 



3786 
36 



4966 
19 



Test No. 4. Division 

The test consists of 24 examples like the following, time 8 
minutes. 



29)24679 



67)61642 



38)32300 



64)61604 



46)34086 



76)66600 



92)27784 



83)26643 



The Measurement of Arithmetic 6i 

Nature of the Examples. — The teacher may properly 
question whether the examples appearing in these tests are 
such as commonly appear in ordinary business transactions. 
That the examples of the Courtis test are more or less artificial 
from a social standpoint, and considerably more difficult than 
the transactions actually occurring under business conditions, 
is borne out by a study of the social and business use of arith- 
metic reported in Chapter 8 of the Sixteenth Yearbook of 
the National Society for the Study of Education.^ The 
teacher should remember, however, that the purpose of the 
Courtis examples is merely to test the ability of the pupils 
in the fundamentals. Examples as difficult as those appearing 
in Test i, for instance, will involve all of the difficulties of 
simpler examples, and so in a measure justify themselves in 
that they test the extreme ability likely to be required not 
only of pupils in the public schools under any system, but 
even by the exigencies of business and social situations of 
adult life. Mr. Courtis himself has recognized the fact that 
the multiplication and division problems are entirely too 
difficult for third grade pupils, and as a result his 1916 
standards indicate a zero score for third grade pupils in 
multiplication and division. It is possible that in time 
further adjustment will be made in this same direction. 

Directions for Giving the Tests. — It is assumed that the 
teacher is not interested in a merely theoretical discussion of 
arithmetic tests, but in using them for the measurement of 
the work in her own schoolroom. The next step, therefore, 
in connection with the Courtis tests is to write ^ for a sufficient 
quantity of the research tests in arithmetic, series B, to enable 
her to give the test to the number of pupils which she has in 
her room. 

^ This study has since been confinned by a larger study. Wilson, G. M. : 
"A Survey of the Social and Business Usage of Arithmetic," Teachers 
College Bureau of Publications. 

* See bibliography for directions. 
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The teacher will have no difficulty in administering the 
tests. One or two of the tests can be given during a single 
recitation period. The time for addition is 8 minutes, for 
subtraction 4 minutes. It will doubtless be better to give 
these two tests on one day, deferring the tests in multiplica- 
tion and division until the next day. The instructions follow 
herewith. 

Instructions to Examiners 

1. For each room, prepare as many bimdles of papers as there 
are rows of seats, putting into each bundle as many papers as there 
are seats in each row. 

2. Begin by saying, "My purpose this morning is to measure 
how well this school teaches its children how to add, subtract, 
multiply, and divide. I have here some printed tests. They are 
not examinations, because exactly these same tests are given to 
all the grades from the third through high school. They are also 
being given in other schools in this city, and in other cities all over 
the coimtry. It is the school that is being examined to-day. If 
you treat the tests as though they were a game, you will enjoy 
them and do your best for the honor of your school. I am going 
to give each of you a set of these papers, but do not look at them 
until I tell you to do so. Will the boys and girls in the front seats 
please distribute them for me?*' 

3. Distribute the papers by putting a bundle on the first desk 
in each row and letting the children do the rest. 

4. Have the children fill out the blanks at the top of the first 
page. Write the date in figures, and the time to the nearest half 
hour; thus: 9-25-1913-10:30. 

5. Have the children read instructions for Test i aloud in 
concert. 

6. "Now please hsten closely. In these tests it is important 
that we all start at the same time and stop at the same time. We 
can do this easily, if you follow my instructions exactly. Lay 
your papers on your desks in position to work the examples, but 
dose the cover with your left hand, keeping it between your thumb 
and finger, like this (illustrate), so that you can open it quickly 
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when I tell you to start. Take your pencil in your right hand, 
and when I say *Get ready/ raise your pencil hand in the air as 
if you were going to ask a question. (Illustrate, by suiting the 
action to the words.) Then when I say * Start/ you can bring 
your pencil down as you turn the cover back, and every one will 
start at the same time. When I say * Stop/ I want you all to stop 
at once, and to raise your hands again so that I can see that you 
have stopped. Now I think we are ready to try the test." 

When the second hand of the watch reaches the S5-second mark 
say " Get ready for the addition test. Hands up." Exactly at 
the 60 mark say "Start." 

Allow Exactly Eight Minutes 

"Stop. Hands up." Make sure all have stopped. "Count 
how many examples you have finished, and write the number in 
the score card in the comer under the number attempted. Do 
not count examples you have begun but have not finished. Your 
score is the number of the examples you have finished. I am 
coming to your desk to see that you have written it in the right 
place." 

7. Read the answers from an Answer Card (be sure the form 
number corresponds with that of the tests), and have the children 
check answers right or wrong, counting the number right, and 
writing it in their score cards. 

8. In similar fashion give and score the other tests. 
For Test 2, Subtraction, allow exactly FOUR minutes. 
For Test 3, Multiplication, allow exactly SIX minutes. 
For Test 4, Division, allow exactly EIGHT minutes. 

9. Give Tests i and 2 the first day, and Tests 3 and 4 the next. 
All may be given at one time if desired. 

Scoring the Results. — The teacher may save herself 
much work, and on the whole secure equally satisfactory 
returns if she has each child score his own paper at the close 
of the tests (or papers may be exchanged for scoring). This 
method has the added advantage of enlisting the interest of 
the children. 
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Ask the child to count the number of examples for which 
he has a complete answer. This number is to be placed in the 
little square at the upper right-hand corner of the test paper, 
in the blank following " Number Attempted." The next 
thing to do is to read from the key, furnished with the tests, 
the correct answer to the various examples. Each pupil 
should follow, crossing out on his sheet any answer which is 
not correct. Have pupils then count the number of correct 
answers, and place the result in the upper right-hand corner, 
in the blank following " Number Right." If the pupil is 
made to understand the purpose of the work and the necessity 
of knowing exactly the condition of the class, and for that 
matter his own condition, there will be no difficulty in getting 
full cooperation on the part of the pupils and honesty from 
every member of the class. 

At the conclusion of the test the pupil should make his 
own graph, showing attempts and rights in the four funda- 
mental processes, as per the example which follows. 




Research Tests in Arithmetic 

INDIVIDUAL score SHEET 



ARITHMETIC 
Series B 



Name..— -M?:.'?'!^.— .., ..%., Age last birthday... {f. 

boy or girl 

School fh^A^.: Grade .^.......RoonL..J.... 

State Date...?.%^..'.f. 



City 







INDIVIDUAL scores 




CLASS 


SCORES 




Attempts 


Rights 


Attempts 


Rights 


Test 

No. I 
No. 2 
No. 3 

No. 4 


SXJBJECT ISt 

Trial 
Addition 
Subtraction 
Multiplication 
Division 


2nd 
Trial 


Change 


ISt 

Trial 


2nd 
Trial 


Change 


ISt 

Trial 


2nd 
Trial 


ISt 

Trial 


2nd 
Trial 
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6s 



GRAPH 



Addition 


Subtraction 


Multiplication 


Division 


Attempts 


Rights 


Attempts 


Rights 


Attempts 


Rights 

24 


Attempts 

24 


Rights 


24 


24 


24 


24 


24 


24 


23 


23 


23 


23 


23 


23 


23 


23 


22 


22 


22 


22 


22 


22 


22 


22 


21 


21 


21 


21 


21 


21 


21 


21 


20 


20 


20 


20 


20 


20 


20 


20 


19 


19 


19 


19 


19 


19 


19 


19 


18 


18 


18 


18 


18 


18 


18 


18 


17 


17 


17 


17 


17 


17 


17 


17 


16 


16 


16 


16 


16 


16 


16 


16 


16 


15 


15 


15 


15 


15 


15 


15 


14 


14 


14 


14 


14 


14 


14 


14 


13 


13 




i-ji^w 


13 
\^12 


13 
12 


IS 
,12-... 


13 
12 


12 


-^3v 

12 ^ 


1^ 




11 


11 


....11.., 


11 


\ll— 


^11— 


^11^ 


Vii 


10 


10 / 


10 \ 


10 


10 


10 


/ 10 


Mo 


K 


^ / 


9 


*.» y •.•••••••••••••Q»»»^ 


• «/ 


• 

9 


9 


8 ••-..., 


. 8/ 


8 


8 


8 


•^...«.- 


8 


8 


7 


••7- 


7 


7 


7 


7 


7 


7 


6 


6 


6 


6 


6 


6 


6 


6 


5 


5 


5 


5 


5 


5 


5 


5 


4 


4 


4 


4 


4 


4 


4 


4 


3 


3 


3 


3 


3 


3 


3 


3 


2 


2 


2 


2 


2 


2 


2 


2 


1 


1 


1 


1 


1 


1 


1 


1 



























Instructions, In each column mark the number that corre- 
sponds to your score for that column. Then with a ruler draw 
a line from each number so marked to the next. Draw a curve 
for the class scores in the same way, using a dotted line. By 
comparing the two curves you can tell how much your scores are 
above or below the class results. 

This individual score sheet will appeal to children, and 
will be exceedingly serviceable in securing the necessary further 
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progress of the children. The teacher may find it worth 
while to arrange all of the score sheets in order of excellence 
by p inni ng them on a piece of burlap on the side of the room. 
The pupil's score (dotted line) appearing on the individual 
score sheet above is the record of an eighth grade pupil who 
is about average in ability. 

In order that the teacher may get an inteUigent view of the 
performance of her class, it will be necessary to make a 
distribution of the class scores, and it is suggested that this 
be made in such a way as to show both speed and accuracy. 
Table i6, which follows herewith, shows such a distribution for 
35 eighth grade pupils. 

Table i6. — Showing Distribution or 35 Eighth Gkade Pupils in 

Speed and Accuracy in September. Addition, Series B 

Score in Examples Attempted. (Speed) 
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The table is to be read as follows: Out of 35 eighth 
grade pupils taking the addition test, one had a score of 
7 attempts and 4 rights, one had a score of 8 attempts and 
4 rights, etc. 

It will be observed by reading the totals at the bottom 
of the sheet that one pupil attempted seven examples, two 
attempted eight, etc. One pupil in the class attempted as 
many as sixteen examples. The median score for attempts 
is II. 

The next question is, what proportion of the examples 
attempted were solved correctly. Totals for this may be 
read from the right-hand column. It will be observed that 
the one pupil who attempted seven problems solved four of 
them correctly. Of the two pupils who attemted eight, one 
solved four correctly, the other five. The one pupil who at- 
tempted 16 problems solved twelve of them correctly. The 
median performance of the group was seven problems solved 
correctly, and this is 63.6% of the median for attempts. 
It is quite evident from a study of the distribution that there 
is a wide range of ability in the class. The two pupils who 
solved twelve problems correctly certainly did three times as 
well as the two pupils who solved four problems correctly. 
Naturally the next question is, what is a reasonable standard 
for this particular eighth grade, and likewise for each of the 
grades. 

Standard Scores. — The question is best answered by 
indicating the standards set up by Courtis, and by noting 
the performance of children throughout the country. The 
standards set up by Courtis are based upon an experience of 
five years and the scoring of thousands of test papers. In 
general, Courtis has placed his standards slightly above the 
median performance of children throughout the United 
States. The standards are indicated in Table 17, which 
follows herewith, and the median performances are shown in 
Table 18. 
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Table 17. — Courtis Standard Scx)res, for May, Arithmetic Tests, 

Series B 





Addition 


SuBxaAcnoN 


Multiplication 


Division 


Gkaoe 
























191S ' 1916 

1 


191S 


19x6 


191S 


1916 


191 s 


19x6 


III 


1 
3 ' 4 


4 


5 


3 





2 





IV 


5 


6 


6 


7 


5 


6 


4 


4 


V 


7 


8 


8 


9 


7 


8 


6 


6 


VI 


9 


10 


10 


II 


9 


9 


8 


8 


vn 


II 


II 


II 


12 


10 


10 


10 


10 


VIII 


12 


12 


12 


13 


II 


II 


II 


II 



Standard of Accuracy 100 % 

The standards are given in Table 17, for the year 191 5 as 
well as for 1916. The fact that after four or five years of 
work Courtis is still changing his standards tends to give the 
teacher a feeling of security in case her own results vary con- 
siderably from the standards. In fact, the individual teacher 
should not worry greatly about the standards, although they 
will be found valuable, and an attempt should be made to 
reach them. The work most helpful to the teacher, however, 
is the distribution of the scores for her own children, and the 
next section will discuss the significance of such distribution 
for a class. 

The teacher who is interested in knowing what the per- 
formance of children elsewhere has been, is directed to Table 
18. This gives the median performance of thousands of 
children for grades 4 to 8 inclusive throughout the country. 
Under each grade the first line gives the median as deter- 
mined by Courtis in 1916, and on which he has based his 1916 
standards. The second line under each grade gives the 
same data for 191 5. The next four lines under each grade 
give the 191 5 performance as reported from Indiana, Iowa, 
Kansas, and Boston, respectively. It is suggested that the 
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teacher study this table with particular reference to the grade 
or grades included in her own room. Is a speed of 11 and 
an accuracy of 63.6%, as shown for the eighth grade in Table 
16, up to the average of children throughout the country? 
How much better is it than the average eighth grade child 
did in Indiana in 191 5 ? How much below the Boston median 
performance? The answer to these questions will interest 
the children fully as much as they interest the teacher. 

With the above questions in mind, and with the standards 
as given in Table 17, and the median performances as given 
in Table 18 before her, the teacher may now return to the 
distribution of grades for the 35 eighth grade pupils shown in 
Table 16. The median attempts for the 35 pupils are 1 1 . The 
Courtis standard in addition for 191 5 and 191 6 is 12. It 
appears, therefore, that the median performance of the class 
is one below standard, so far ^s speed is concerned. It will 
be observed, however, that Boston alone, as shown in Table 18, 
equals or exceeds the Courtis standard. Courtis's own sum- 
mary for performance in 191 6 shows 10.2 as the median for the 
eighth grade (Table 18). It appears, therefore, that this 
particular eighth grade has done a little better than the eighth 
grade pupils reported on by Courtis in 1915 or 1916, and also 
a little better than the eighth grade pupils included in the 
summaries for Indiana, Iowa, or Kansas. In view of this 
comparison, the teacher may feel that her grade is on a 
reasonably satisfactory basis so far as speed is concerned. 
Turning now to accuracy, it is observed that this particular 
eighth grade class has an accuracy of 63.6%. Courtis calls 
for 100%. This, however, was not attained by any of the five 
groups included in Table 18. In addition the Boston eighth 
grade did be^t with an accuracy of 78%. It will be observed, 
however, that no one of the five groups goes quite as low as 
this particular eighth grade. This fact and the scattered 
distribution noted above emphasize the importance of giving 
attention to the addition work of this particular eighth grade. 
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Table i8. — Median Performances for May, Courtis Arithmetic 

Tests, Series B 







1916 


Addition 


Subtraction 


MULTIPLICATIQN 


Division 


Grade 


Speed 


Accu- 
racy 

% 
60 


Speed 


Accu- 
racy 

% 
76 


Speed 


Accu- 
racy 

% 

68 


Speed 


Accu- 
X9xy 

% 




Courtis . 


6.7 


7.3 


6.3 


4.5 


59 




Courtis . 


1915 


5.9 


40 


6.2 


52 


5.0 


50 


3.6 


42 


IV. 


Indiana . 
Iowa . . 


1915 
1915 


6.9 


58 


7.4 


73 


6.3 


66 


4.8 


63 




Kansas . 


1915 


5.9 


51 


6.4 


64 


5.2 


58 


3.8 


56 




Boston . 


1915 


8.0 


67 


7.6 


82 


6.2 


67 


4.8 


6S 




Courtis . 


1916 


7.8 


68 


8.6 


82 


7-5 


76 


5.7 


77 




Courtis . 


1915 


6.3 


58 


7.8 


73 


6.8 


68 


5.4 


65 


V. 


Indiana . 
Iowa . . 


1915 
1915 


7.2 
8.2 


59 
64 


7.5 
9.0 


71 
78 


6.0 
7.6 


61 
74 


5.0 
6.2 


65 
81 




Kansas . 


1915 


7.0 


61 


7.9 


75 


7.0 


69 


4.9 


68 




Boston 


1915 


9.4 


71 


9.3 


85 


7.7 


76 


6.5 


81 




Courtis . 


1916 


8.9 


71 


9.7 


84 


8.8 


79 


7.8 


87 




Courtis . 


1915 


8.4 


67 


9.2 


82 


7.9 


76 


7.1 


83 


VI. 


Indiana . 
Iowa . . 


1915 
1915 


8.3 

8.8 


64 

75 


8.7 
9.9 


77 
89 


7.5 


68 

75 


6.1 
7.6 


79 
84 




Kansas . 


1915 


8.1 


65 


9.1 


81 


8.1 


77 


6.5 


84 




Boston . 


1915 


II. I 


75 


II. I 


87 


9.4 


79 


8.7 


87 




Courtis . 


1916 


9.8 


72 


II. 2 


85 


lO.O 


80 


9.6 


91 




Courtis . 


1915 


9.2 


67 


10.6 


83 


9.0 


78 


8.1 


83 


vn. 


Indiana . 
Iowa . . 


1915 
1915 


8.9 
9-4 


64 
70 


9.9 

II.O 


80 
83 


8.5 

10.4 


71 
79 


7.8 
9.0 


84 
89 




Kansas . 


1915 


8.7 


67 


lO.O 


83 


9.0 


78 


9.3 


87 




Boston . 


1915 


12.3 


76 


12.2 


87 


IO-5 


81 


10.2 


90 




Courtis . 


1916 


10.2 


74 


12. 1 


85 


II.O 


81 


10.4 


93 




Courtis . 


1915 


10.2 


67 


12.3 


81 


10.6 


82 


10.6 


91 


vm. 


Indiana . 
Iowa . . 


191S 
1915 


9.5 
10.3 


67 
72 


10.9 
12.9 


82 
86 


9.9 
11.6 


74 
83 


9.7 
12.0 


87 
93 




Kansas . 


1915 


9.8 


71 


"•5 


86 


10.9 


82 


10.9 


92 




.Boston . 


1915 


13.7 


78 


13.6 


89 


11.6 


83 


12.2 


92 
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Remedial Instruction. — In discussing remedial instruction 
it will be well to recur again to the distribution of the eighth 
grade pupils shown in Table 16. The fact that no pupil 
reaches 100% accuracy indicates that the addition com- 
binations are not fully mastered by any member of the class, 
that the addition problems in the test are too difficult for the 
mental attainments of the group, or that pupils are careless 
in their procedure. There may be other explanations of in- 
accuracy. Let us assume, however, that the first reason is the 
one which obtains with this particular grade, and it is quite 
likely the correct reason for most members of the class, since 
we may assume that an eighth grade class should be able to 
solve problems as difficult as those used in the test, and that 
if the test is administered properly, carelessness will not be 
very evident in the returns. The question, therefore, is 
how to re-teach the number combinations in such a way 
that they will be thoroughly known by every member of the 
class. It is not the province of the present work to discuss 
methods in any extended way, but merely to show the use of 
standard tests. If this test has revealed the defect correctly, 
it is then the teacher's problem to become acquainted with 
the methods which will enable her to teach the addition com- 
binations. In brief, one of the best courses in arithmetic^ 
indicates three steps in teaching the addition combinations: 
first, the mastery of the 45 elementary combinations and their 
reverses; second, carrying these same combinations up 
through the decades and drilling on the same until proficiency 
is obtained ; and third, column addition. 

If the teacher does not understand the details involved in 
these three steps, she will of course need to become acquainted 
with them. In discovering exactly what a particular pupil's 
difficulty is, the teacher will find it exceedingly valuable to 
have the pupil take one of the test examples at the board and 

^ The Connersville Course of Study in Elementary Mathematics. Re- 
published by Warwick and York (in press). 
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proceed with his work orally. This will enable the teacher 
to follow and accurately observe the pupil's mental processes. 

The above discussion emphasizes the fact that drill alone 
is not the most important consideration. The first duty 
of the teacher is to discover the difficulties of individual 
pupils. Then pupils can be grouped according to common 
difficulties. In this connection it may be mentioned that the 
Boston score, which stands well at the top, is a result obtained 
after careful procedure in diagnosis and correction, followed 
by needed drill, according to directions similar to the above, 
for a period of three years. Equally satisfactory results have 
been obtained in other cities where superior skill has directed 
the work in the mechanical phases of the fundamental pro- 
cesses. For example, the results obtained in an Indiana 
city ^ under a teacher who helped in making the Connersville 
course of study in arithmetic and was interested in the first 
use of the Courtis tests in that city, are not only much above 
the Indiana median, but they are even above the Boston 
average. These results were obtained by (i) systematizing 
the drill for the class as a whole, and (2) discovering the 
difficulties of individual pupils and giving the necessary 
specific help. All agree that drill, to be effective, must be 
intelligently systematized, and given at frequent intervals. 

Retesting. — After the teacher has worked with her 
pupils faithfully, as individuals and as a class, she will want 
to retest the class in order to measure the results of her 
efforts. This may be done at any time, and the results will 
interest the members of the class fully as much as the teacher. 
The rules must be observed carefully in order that the test 
may be real and in order that comparisons may be valid. 

Table 19 shows the results of a retest of the 35 pupils 
shown in Table 16, after four months of careful work. The 
table shows that the pupils are still widely scattered in 
ability, but that the class as a whole has made good improve- 

^ Connersville, Indiana. 
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ment. The median of attempts has been raised from ii to 
1 2, and of rights from 7 to 10. The improvement in accuracy 
from 63.6% to 83.3% is very satisfactory. Because of in- 
dividual differences, a teacher may expect wide variations in 
speed within an eighth grade class, but she should not be 

Tabu: ig. — Showing Distribution of the 35 Eighth Grade Pomls 
OF Table 16, in Speed and Accuracy in Addition, apter Special 
Help and Drill (January) 

Score in Examples Attempted (Speed) 







A 


^ 


L 


— 


1 


1 


_ 


1_ 


- 


!1 


« 


„ 


.7 


,. 


t™. 














g 




I 


































W 




< 


































fi" 




4 


































s 




6 


































¥ 




8 
9 












■ 


! 


I 


2 
3 


; 


• 










3 
5 
8 
5 
6 


a 




13 




















I 


I 


f 


f 






4 


,? 




14 


























'- 






' 


s 




Totals 








' 


' 


3 


6 


7 


S 


4 


" 


^ 


1 







Median Attempts 
(Speed) — Ii 

content until pupils in an upper grade are letter perfect in 
solving simple examples in the fundamental processes, i.e. 
imtil 100% accuracy is reached. 

Since the results in addition as shown in Table 16 were 
secured in September, and the results in Table 19 in January, 
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and since the standards as brought out in Tables 17 and 18 are 
based upon May tests, the teacher of the grade whose results 
are shown in Tables 16 and 19 may reasonably expect that 
her pupils will be up to standard when she gives the tests in 
May. 

In Figure 6 the class improvement in rights from Septem- 
ber to January is shown graphically. This graph comes 
directly from the rights in Tables 16 and 19. 
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Fig. 6. — Showing attainment of the class in September (single line 
graph) and in January (double line graph). The entire class has moved 
steadily to the right, which means an improvement in score. 

Other Arithmetic Tests. — Other available tests at the 
present time are the Stone Reasoning Tests, the Woody 
Arithmetic Scales, the Boston Tests in Addition of Fractions, 
the Cleveland Survey Arithmetic Tests, and the Monroe 
Diagnostic Tests in Arithmetic. These tests will in turn be 
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described briefly, more particular attention being given to 
the Stone Reasoning Tests and the Woody Arithmetic Scales, 
as these are needed to supplement the Courtis tests, and can 
be used by teachers without particular difficulty. It is 
evident that what can be measured in arithmetic depends 
somewhat on the test being used. In general it is the per- 
formance of the pupils which is tested, and this may include 
speed and accuracy, or accuracy only, according to how the 
tests are administered. The purpose in arithmetic, as in all 
testing, should be to find out the present condition of the child, 
in order to prescribe remedies in tase he needs help, or in 
order to release him from further drill, in case he is fully up 
to reasonable standards. 

Reasoning Tests. — When arithmetic is put to practical 
business use, it is always connected with an actual situation, 
and the solution requires judgment or reasoning as to the 
processes involved. For upper grade work no test of arith- 
metic is complete which fails to test reasoning ability. The 
Stone Reasoning Test has been most used. It consists of 
twelve problems, ranging in value from i to 2, as follows : 

THE STONE REASONING TEST 

(Time Exactly 15 minutes) 
School Grade Name of pupil 



Problem 
Value 



I.O 



I.O 



Problems 



Solve as many of the following problems as you have 
time for ; work them in order as numbered : 

1. If you buy 2 tablets at 7 cents each and a book for 65 

cents, how much change should you receive from a two- 
doUar-biU ? 

2. John sold 4 Saturday Evening Posts at 5 cents each. He 

kept I the money and with the other J bought Simday 
papers at 2 cents each. How many did he buy ? 
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THE STONE REASONING TEST {Continued) 



Problem 
Value 



I.O 



I.O 



I.O 



1.4 



1.2 

1.6 



2.0 



2.0 



2.0 



2.0 



Problems 



3. If James had 4 times as much money as George, he would 

have $16. How much money has George? 

4. How many pencils can you buy for 50 cents at the rate 

of 2 for 5 cents? 

5. The imiforms for a baseball nine cost $2.50 each. The 

shoes cost $2 per pair. What was the total cost of uni- 
forms and shoes for the nine? 

6. In the schools of a certain city there are 2200 pupils; J are 

in the primary grades, J in the grammar grades, J in the 
high school and the rest in the night school. How many 
pupils are there in the night school ? 

7. If 3 J tons of coal cost $21, what wiU 5} tons cost? 

8. A news dealer bought some magazines for $1. He sold them 

for $1.20, gaining 5 cents on each magazine. How many 
magazines were there ? 

9. A girl spent \ of her money for car fare and three times as 

much for clothes. Half of what she had left was 80 cents. 
How much money did she have at first ? 

10. Two girls receive $2.10 for making buttonholes. One 

makes 42, the other 28. How shall they divide the 
money? 

11. Mr. Brown paid one third of the cost of a building; Mr. 

Johnson received $500 more annual rent than Mr. Brown. 
How much did each receive ? 

12. A freight train left Albany for New York at 6 o'clock. An 

express train left on the same track at 8 o'clock. It went 
at the rate of 40 miles an hour. At what time of day 
wiU it overtake the freight train if the freight train stops 
after it has gone 56 miles ? 



The papers are scored by giving to each problem solved 
correctly the value as indicated at the left of each problem 
in the above. The test was first formulated for upper sixth 
grade pupils, but it is equally good for seventh or eighth 
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grade pupils. It is too difficult for good results in grades 
below the sixth. 

Dr. Stone has recently issued^ the following grade 
standards : 



Gkade 



5 
6 

7 
8 



Standard 



Score of 5.5, reached or exceeded by 80%, 75% accuracy. 
Score of 6.5, reached or exceeded by 80%, 80% accuracy. 
Score of 7.5, reached or exceeded by 80%, 85% accuracy. 
Score of 8.75 reached or exceeded by 80%, 90% accuracy. 



It is quite probable that the median scores secured through 
the use of the Stone reasoning tests in various surveys form a 
more usable standard than the one suggested by Dr. Stone. 
These scores are shown in Table 20. 

Table 20. — Showing Median Scores Obtained in the Use of the 

Stone Reasoning Tests 



Grade 


Stone 1908 

26 CiTIfiS 


Butte, 
Mont. 1914 


Salt Lake 
City 191 5 


• 

Boston ] 
1916 


5 




2.2 


3.7 




6 


5.5 


3.9 


6.4 


4.0 


7 




5.8 


8.6 


6.4 


8 




7.7 


10.5 





Brookline 
Mass. 



4.0 
6.2 




Nassau 

Co.N.Y. 

1917* 



4.5 
7.2 



The teacher will find it worth while to use the Stone reason- 
ing tests, although the standards are not so definite as for 
the Courtis tests in the fundamentals. It will be simpler 
to take the returns from a single city, as for example. Salt 

^ Stone, C. W., "Standardized Reasoning Tests in Arithmetic and How to 
Use Them." (Teachers College Bureau of Publications.) 
* The scoring is such as to slightly raise the score. 
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Lake City, as a standard. If pupils fail to reach the Salt 
Lake City standard, they are not doing as well as pupils 
have done in an average city system. 

Diagnostic Tests. — The teacher who has followed the 
discussion closely will appreciate the fact that the Courtis 
tests, while measuring ability, do not analyze the difficulties 
and do not permit the teacher to use them easily for analyzing 
a pupil's shortcomings. This defect of the Courtis tests is 
being overcome gradually by the formation of other tests 
which have better diagnostic possibilities. Among these are 
the Woody Arithmetic Scales. 

Woody Scales. — The Woody scales were not originally 
designed for diagnostic purposes, but they are being made to 
serve that purpose, as well as their original purpose of measuring 
the ability of children. They are constituted quite differently 
from the Courtis tests. Each Courtis test consists of a series 
of problems of equal difficulty in one of the fundamental 
processes. The Woody scales consist of a series of problems 
of increasing difficulty. They are designed to measure work 
in the four fundamental operations: addition, subtraction, 
multiplication, and division. While constructed on a statistical 
basis rather than for the purpose of serving as the basis of an 
analysis of subject matter needs, yet at the same time they 
do cover subject matter reasonably well. The addition scale, 
iqx instance, covers simple combinations in one, two, three, 
and four column addition ; examples with addends from two 
to sixteen ; addition of simple fractions ; addition of decimals ; 
addition of U. S. money ; addition of denominate numbers ; 
and addition of mixed numbers. The additions are ex- 
pressed in column form and by the plus sign. Thus the 
pupil is tested, more or less, over the entire range of addition 
possibiUties, by a series of problems ranging in difficulty 
from those so simple that any third grade pupil may solve 
them, up to other problems so difficult that few eighth grade 
pupils succeed in solving them. This will appear by exami- 



The Measurement of Arithmetic 79 

nation of the addition scale which follows herewith. The 
subtraction, multiplication, and division scales also follow. 

SERIES A^ 

Addition Scale 

(20 minutes) 

Name 

When is your oext birthday? How old will you be?. . . . 

Are you a boy or girl ? In what grade are you? 

(i) (2) (3) (4) (5) (6) (7) (8) (9) 
2 2 17 53 72 60 3+1= 2+5+1= 20 

3 4 _2 45 26 37 10 

30 
£5 

(10) (11) (12) (13) (14) (is) (16) (17) (18) 

21 32 43 23 24+42= 100 9 199 2563 

33 59 I 25 33 24 194 1387 

_3S £7 2 16 45 12 295 4954 

13 201 15 156 2065 

46 19 

(19) (20) (21) (22) (23) (24) (25) 

$ .75 $12.50 $8.00 547 i+i= 4-0125 f+f+i+i=« 

1.25 16.75 5-75 197 1-5907 

■49 15-75 2.33 685 4.10 

4.16 687 8.673 

.94 456 

6.32 393 

525 
240 

152 

* The scales are printed in large type, on separate sheets, 8J"Xii", with 
ample space for the insertion of answers. 
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(26) (27) (28) (29) (30) (31) (32) 

i2i i+i+i= H-i= 4I 2* 113-46 l+i+i= 

62^ 2i 6f 49.6097 

i2i Sj 3f 19-9 
37t 9-87 









.0086 










18.253 
6.04 




(33) 

.49 
.28 

.63 


(34) 


(35) 

2 ft. 5 in. 

3 ft. 5 in. 

4 ft. 9 in. 


(36) 

2 yr. 5 mo. 

3 yr. 6 mo. 

4 yr. 9 mo. 


(37) 

i6i 

I2i 
2li 


•95 
1.69 






5 yr. 2 mo. 

6 yr. 7 mo. 


32I 


.22 










•33 
.36 










I.OI 










•56 

.88 

•75 
•56 


25.091 + 


(38) 

100.4+25+98. 


28+19.3614 = 




1. 10 










.18 










•56 











SERIES A 

Subtraction Scale 

Name, 

When is your next birthday? How old will you be?. . . . 

Are you a boy or girl? In what grade are you? 

(i) (2) (3) (4) (5) (6) (7) (8) (9) (10) (11) 

8 6 2 9 4 II 13 59 78 7-4= 76 
S o I 3 4 7 8 12 37 60 
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(12) (13) (14) ds) (16) (17) (18) (19) (20) 

27 16 50 21 270 393 1000 567482 2f — 1 = 

__3_ _9 JS __9 i90 278 _537 106493 

(21) (22) (23) (24) (25) (26) 

10.00 3i— i= 80836465 8^^ 27 4 yd. I ft. 6 in. 

3-49 49178036 5f i2f 2 yd. 2 ft. 3 in. 

(27) (28) (29) (30) 

5 yd. I ft. 4 in. 10—6.25= 75f 9.8063—9.019 = 

2 yd. 2 ft. 8 in. 52^ 

(31) (32) (ss) (34) (35) 

7.3—3.00081= i9i2 6mo. 8 da. A-Tir= 6i 3|-if = 

1910 7 mo. 15 da. 2^^ 

SERIES A 

Multiplication Scale 

Name 

When is your next birthday? How old will you be?. . . . 

Are you a boy or girl? In what grade are you? 

(i) (2) (3) (4) (5) (6) (7) 

3X7= 5X1= 2X3= 4X8= 23 310 7X9 = 

(8) (9) (10) (11) (12) (13) (14) (15) 
50 254 623 1036 5096 8754 165 235 
_3_ __6 _7 8 6 8 _4o _23 

(16) (17) (18) (19) (20) (21) (22) 
7898 145 24 9.6 287 24 8X51 = 
9 206 234 4 .05 2^ 

(23) (24) (25) (26) (27) (28) (29) 
iiX8= 16 Jxf= 9742 6.25 .0123 iX2 = 

2f 59 3-2 9-8 

G 
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(30) (31) (32) (33) (34) 

249 TlXif= 6 dollars 49 cents 2^X3^= ixi= 

36 8 

(35) (36) (37) (38) (39) 
987I 3 ft. 5 in. 2iX4iXii= .0963! 8 ft. 9i in. 

25 5 »o84 9 

SERIES A 

Division Scale 

Name 

When is your next birthday? How old will you be?. . . . 

Are you a boy or girl ? In what grade are you? 

(0 (2)_ (3^ (4)_ (S) (6)_ 

3)6 9)27 4)28 1)5 9)36 3)39 

(7) m_ i9)_ (10) (11) (12) 

4-5-2= 9)0 1)1 6X?=30 2)13 2-S-2 = 

(13) (14) (15) (i6)__ (17) 

4)24 lb. 8 oz. 8)5856 ^ of 128= 68)2108 50-^7 = 

(18) (19) (20)_^ (2l)_^ (22) 

13)65065 248-^7= 2.1)25.2 25)9750 2)13.50 

( 23) (24) (25) (26) 

23)469 75)2250300 2400)504000 12)2.76 

(27) (28)_ (29) (30) 

J of 624= .oo3).0936 3i-^9= i-5-5 = 

(31) (32) (33) 

• _ 

iH= 9|-5-3l= 52)3756 

(34) (35) (36) 
62.son-iJ= 531)37722 9)69 lb. 9 oz. 
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Directions for Giving Woody Tests. — The directions for 
administering the Woody scales accompany the sheets, 
which may be secured from Teachers College Bureau of 
Publications. It is quite essential that uniform methods be 
followed in order to make results comparable. The papers 
are distributed with face down. When pupils are ready with 
pencils in hand, they are told to turn over the paper and 
answer the questions at the top of the page. The specific 
directions for the addition test as given by Dr. Woody are as 
follows : " Every problem on the sheet which I have given 
you is an addition problem, an ' and problem.' Work as 
many of these problems as you can and be sure you get them 
right. Do all of your work on this piece of paper and don't 
ask anybody any questions. Begin." 

For the series A scales, twenty minutes are allowed for 
each test. There are shortened scales, series B, which are 
given in ten minutes each, but since the purpose in using the 
Woody scales will doubtless be to benefit more or less by their 
diagnostic values, it is assumed that teachers will prefer to 
use the longer scales of series A. It may be noted at this 
point that the time for giving the tests has been varied.^ 
While a shortened time gives slightly better distributions, 
particularly in the upper grades, yet the problems at the upper 
end of the scales are so difficult that few pupils will solve 
them even when given all of the time necessary. As a 
matter of fact, 20 minutes, the time allowed, is sufficient for 
most upper grade pupils to complete any one of the tests. 
The result is that in the upper grades, accuracy only is 
measured. But in using the Woody scales, it is likely that 
accuracy is the thing in which the teacher will be chiefly 
interested. 

In using the other Woody scales, the directions are the 
same as for addition except the substitution of such expressions 
as " subtract or ' take away ' problems " ; " multipUcation or 

* The Nassau County Survey used i8 minutes instead of 20. 
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Table 21. — Answers to Problems in Woody Scales 



Probt.km 


Addition 


Subtraction 


MUTLTIPLICATION 


Division 


I 


5 


3 


21 


2 


2 


9 


6 


5 


3 


3 


19 


I 


6 


7 


4 


98 


6 


32 


5 


5 


98 





69 


4 


6 


97 


4 


1,240 


•13 


7 


4 


5 


63 


2 


8 


8 


47 


150 





9 


87 


41 


1,524 


I 


10 


89 


3 


4,361 


5 


II 


108 


16 


.8,288 


6j not 6+1 


12 


59 


24 


30,576 


I 


13 


64 


7 


70,032 


6 lb. 2 oz. not 6+: 


14 


67 


25 


6,600 


732 


IS 


425 


12 


5,405 


32 


16 


79 


80 


71,082 


31 


17 


844 


215 


29,870 


^\ not 7+1 


18 


10,966 


463 


5,616 


5,005 


19 


$2.49 


460,989 


38.4 


35f not 35+3 


20 


$45.00 


it 


14.35 


12 


21 


$27.50 


6.51 


60 


390 


22 


3,873 


3 


46 


6.75 


23 


f 


31,658,429 


10 


20^ ; 20.3, no 
20+9 


24 


18.3762 


3i 


42 


30,004 


25 


2, not V nor '{ 


I4f 


ii 


210 


26. 


125, not I23i = 2 


I yd. 2 ft. 3 in. 
not 63 in. 


574,778 


.23 


27 


J 


2 yd. I ft. 8 in. 
not 81 in. 


20,000 


546 


28 


I not J nor \ 


3} or 3.7s 


.12,054 


31.2 


29 


12I not iif =ij 


23 J not 23|=J 


Jnotf 


A 


30 


i2|not iiy = if 


.7873 


89.64 


A or .15 


31 


217.1413 


4.29919 


^TT 


s 


32 


ijnot f nori}=J 


I yr. 10 mo. 23 


$51.92 or SI dol. 






da. 


92 cts. 




33 


10.55 


H 


8f 


72^ or 72.23 


34 


if 


3} not 3f =i 


i 


50 


35 


10 ft. 8 in. or loj 
ft. 


2I not 2}= j 


24,693? 


71TI7 or 71.04 


36 


22 yr. 5 mo. or 
22A yr. 




17 ft. I in. 


7 lb. 1 1} oz. 


37 


82H 




isA 




38 


268.13^4 




.0080902^ or 
.00809025 




39 






79 ft. i\ in. 
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KOLVED 




Totah 










/ 






/ 






nil 




10 


II 
1 




11 


III 




II 


mill 




IJ 


II 




14 


iniiiii 




TS 


III 




16 


", 




18 


I 




10 






5i 






a4 












26 






27 






aS 






29 






30 






31 






32 






33 






34 






35 






36 







Class Median 13 
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* times ' problems " ; " division or ' into ' problems," for 
" addition or ' and ' problems." In case the teacher has used 
other expressions to indicate one of the processes, these may 
be substituted for the expressions " and " problem, etc. 
The purpose in using these extra expressions is to make clear 
to the child the process which is involved in the particular 
test. 

The standard for marking the examples in the Woody scales 
is absolute accuracy, and the final answer should be in its 
lowest terms. The table on the opposite page gives the cor- 
rect list of answers. 

The method of tabulating the results of the Woody tests is 
very simple. Assuming that a test has been given, indicate 
on the upper corner of each page the number of problems 
solved correctly. Then, for convenience, arrange the papers 
in order according to the number of problems solved. With 
the papers thus arranged, it will be possible to draw off di- 
rectly the results of the test as shown in Table 22. This table 
shows the distribution of a class of 35 with reference to the 
number of problems solved by the different members. 

Table 22 is taken directly from the results of a division test 
given in an intermediate grade,^ November i, 191 7. The 
distribution of pupils' scores resulting from giving the Woody 
Test in Division, Series A, is shown for the entire school 
system in Table 23. 

This is the same form as shown in Table 22, except that it 
covers five grades, and the number of pupils in each grade is 
the complete number for the entire city. The superintendent 
of this school system has shown, exceptionally well, the 
diagnostic possibiHties of the Woody scales. In the study 
referred to, he analyzes the division difficulties of pupils as 
shown by the errors they have made in attempting to solve 
the problems in the division scale. It will be well for the 

^ Anderson, C. J., "Use of Woody Scales for Diagnostic Purposes," Elementary 
School Journaly 16 : June, 1918, pp. 770-781. 
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Table 23. — Distribution of Pupils' Scores 



Number of Problem 



1 . . . 

2 . . . 

3 . . . 

4 . . . 

5 . . . 

6 . . . 

7 . . 

8 . . . 

9 . . . 

10 . . . 

11 . . . 

12 . . . 

13 . . . 

14 . . . 

15 . . . 

16 . . . 

17 . . . 

18 . . . 

19 . . . 

20 . . . 

21 . . . 

22 . . . 

23 . . . 

24 . . . 

25 .... 

26 . . . 

27 . . . 

28 . . . 

29 . . , 

30 . . . 

31 . . . 

32 . . . 

33 . . . 

34 . . . 

35 . . . 

36 . . . 

Total 
Median 



Grade 



IV 



I 

3 
I 

II 

7 

5 
12 

II 

8 

13 

5 

3 

I 

2 



84 

12 



V 


VI 


vn 


I 

• 






I 

I 




» 


2 






9 


2 




7 
6 


3 




9 


I 


I 


3 


2 




2 


2 




7 


4 




10 


4 




2 


4 




12 


2 


2 


6 


• 8 


2 


9 
8 


6 
9 


I 
6 


2 


12 


I 


I 


II 


3 


I 
I 


9 

5 


2 
8 




3 


3 




3 


9 




I 


5 
5 
9 




I 


15 
5 




I 


2 

3 
I 


100 


93 


83 


17 


22 


29 



vm 



I 
I 
I 



I 
I 
I 
2 

3 

3 

4 

3 
6 

3 

5 
2 

I 



40 
30-5 



88 How to Measure 

teacher to summarize for her class the number of wrong 
solutions for each problem attempted. This can be shown 
for a single test by a table similar to Table 22, in which the 
problems are listed by number on the left, and the nxmiber of 
incorrect solutions shown on the right. In the final analysis, 
however, the teacher should study each paper to see what 
mistakes each particular pupil made. This should be done 
in each of the fundamental processes. If the Woody scales 
are used to supplement the Courtis tests for the purpose of 
finding out where the pupil made his mistakes, it will be found 
exceedingly valuable. The various types of errors made in 
division in the city referred to were summarized by the 
superintendent and his teachers as follows : 

1. Ignorance of multiplication tables, 30 per cent. Illustra- 
tion: 8,107 

8)p5"6 

2. Using dividend as a whole, 14 per cent. Illustration: 

3)3? 
12-3 

3. Confusion of multiplication and division, 14 per cent. 
Illustration: 3)39. 

93 

4. Remainder, 10 per cent. Illustration : 6f 

2)13 

5. Confusion of signs, 7 per cent. Illustration: 2-7-2=4. 

6. Form of example strange, 5 per cent. Illustration : ^ of 

128. 

7. Carrying (either forgetting to carry or ignorance of what 
shoiild be carried), 5 per cent. Illustration: 2 )1.350 . 

620 

8. Value of o, 5 per cent. Illustration : 9)0 1)1 

9 o 

9. Confusion of addition and multiplication, 5 per cent. 
Illustration: 3)6. 
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10. Confusion of dividend and divisor, 2 per cent. Illustra- 
tion: 8 )498 . 

212 
(This quotient is explained as follows: 4 into 8 = 2, 8 into 9 
I and I over, 8 into 18 = 2 — 2 over.) 

11. Using some figure in dividend twice, 2 per cent. Illus- 
tration: 8 )5,856 . 

7,107 

12. Transposing answer, i per cent. Illustration : ^ of 128 = 23. 

The teacher with her particular grade should proceed in a 
similar manner, taking up each fundamental process and dis- 
covering the types of errors made. It will be well to note, 
not only the errors made, but after each, the names of the 
pupils making that particular mistake in order that she may 
give special attention to all of the pupils making a particular 
mistake. Suppose, for example, that one of the teachers in 
the above city, on a certain day, desires to work upon the 
fourth one of the listed errors, namely inability to handle the 
remainder. In an intermediate class of 35 pupils, she may 
have four who need help on this point. By referring to her 
paper she will be able to call the names of the four who need 
special instruction. So with each particular mistake, she 
will be able to call for the pupils who need help, permitting 
others in the class to spend their time in some other way. 

The superintendent and his teachers in the city referred to, 
noted that long division was difficult for the pupils, and so made 
a special summary of the errors in long division, as follows : 

1. The assumption that the first integer of the divisor may 
be used always as a trial divisor. 

2. The trial-and-error method of finding quotient. 

3. Ignorance of multiplication tables. 

4. Carrying wrong number when multiplying. 

5. Borrowing in subtraction. 

6. Ignorance of value of cipher. 

7. Forgetting to place integers in quotient. 
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This is a good illustration of the diagnostic use of tests 
as a basis for remedial instruction. Such use of tests makes 
them of direct service in the work of helping pupils, and this 
is a use that must in future receive more and more attention. 

The Woody tests are being quite extensively used. During 
the year 1917-1918, 300,000 copies of the tests were used by 
school men in the United States. This extensive use is 
gradually developing standards of performance. Instead of 
giving standards alone, it will be more helpful to list the 
returns from a number of cities. Accordingly there are given 
in Table 24 the median scores secured in the use of the Woody 
scales in twenty Wisconsin cities, as well as the Woody stand- 
ard medians. 

Table 24. — Median Scores by Cities 



w... 


TSBIKD 


SEcnON 


ADomoN 


SDsniAcnoM 


Cnm9 




ra 


— 


\i. 


VII 


vm 


ni 


16.3 
13.3 

■ 8.3 
iS.s 
■8..S 
.8.1 

17.3 
.84 
17.9 

9.8 


V 


VI 


vn 


vm 




lo/io/ifi 

'&% 
vit: 

3/=7/-T 
=/=6/<7 
3/S/17 

5/8/y 
4/10/17 
4/11/17 
6/4/17 
6/6A7 

5/ A; 


B 

B 

B 
A&B 
A&B 

B 

B 

B 

B 
A&B 
A&B 

A 

A 

A&B 

A 

A 
A&B 
A&B 

A 


II.7 
II.6 

jo,7 
15.3 

14.B 
10. 


.7.8 
17.7 

K).7 

":; 

.0.8 
i3.5 
■9-5 
17.S 

31.S 


«.7 
«.) 

23-3 
10.6 

26-4 
j6.8 


j8, 

30.0 
IJ. 

19.3 
13.8 
jfi.8 

j8,6 
30-8 


33. 1 

18.J 

35.4 
31 = 
3Q.8 

33- 
33-9 


31^6 

3:1^8 
33.4 
26.S 

3!-7 

33-a 
33.1 

31-8 
34.8 


10,7 

13 .8 
T3.6 

8.7 
17.8 


n.9 
ao.7 

.7.7 
34. 


14.4 
17^ 
i7-8 
IS.7 
3S. 
28.5 

184 

37-3 
2S.8 


'7-T 

a6.S 
J7.6 

30- a 
=7.4 
30-6 

a8-7 

"■' 

147 
30.S 


30.8 
30.S 
31.S 

31.6 

28.1 

30.1 


Median 

Woody's Standard Median 


IS.S 


rii 


J3' 


ja.4 

jg.8 


3i.4 


34- 


;,:' 


iS.r 
15-7 


3Q.8 
30.4 


IS,6 
=5- 


>8.s 


31 7 
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Table 24 {Continued) 



„».™„™ 


DivmoM 




10/ we 

1/0/17 

'!"!& 

5/4/ir 
S/8/.7 

6/4/17 
6/6/17 
5/ At 


6 

B 

B 
A&B 
A&B 

B 

B 

B 

B 
A&B 
A&B 

A 
A&B 
A&B 

A 
A&B 

A"&B 

A 
A 


J. 7 

%.i 

6.6 
B.O 
8.5 

7-f 

11.3 


13-7 

14-J 

17.8 

13.8 

IS-B 
'5-5 

I4-" 

30.6 


19.8 

18.8 
18.5 
18,0 
16.1 
iB,o 

184 


36.g 
35.S 

284 
1S.6 
38.3 

J6.3 

J 7-3 

J4.S 
16.5 


30,3 
35-7 

30.5 

31. 

30-4 

30 .i 

35.1 


31.8 

31.3 
33.6 

34- 
20.6 

3'-7 
33-B 


S-6 
&. 

6-7 
.4.8 


I 

14-3 

11.6 
0-8 

iB.S 


17-8 

M.S 

10.8 
18.I 
16. 

■6.S 
i6,B 
15-2 

17.7 
14.7 


35 = 

3 J. 6 
17-6 

'5-4 
.8.S 
i3.3 

"-S 

.84 

38".i 

33.8 


ig. 
38.3 
!8.S 

38.8 
14.6 

ig. 

14.7 
33-S 

28.4 


30.» 

30.7 

'B-9 

3SS 

377 

3I.S 
3S.3 
28.7 
3S-= 

38.1 
13-3 

316 

3= -8 


Median 

Woody'a Standard Median 


" "-.; 


18.3 


iM 


30-6 


33i 
31.0 


7.6 
S-8 


iJ-S 


.6:5 


30. 



Boston Research Tests in Fractions 

These tests are not generally available and are given in this 
connection to suggest to the teacher the possibility of becoming 
keen and active in the work of discovering pupils' errors. The 
Boston test in addition of fractions consists of six simple 
tests of four problems each, each test having a two-minute 
time limit. They cover the various types of problems in the 
addition of fractions, and they increase in difficulty from the 
first example in which the denominators are the same, up 
to the last in which the common denominator can be deter- 
mined with difficulty by introspection. The test follows 
herewith : 
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Addition of Fractions 

Showing Examples Used in Tests in Addition of Fractions, 

December, 191 5 

Test I. — Time, 2 minutes. 



(l) k 
i 


(2) A 


(3) A 

7 
16 


(4) A 

7 
10 


Test 2. 


— Time, 2 minutes. 








(i) \ 
i 


(2) \ 

A 


(3) 


2 

¥ 

1 
12 


(4) -J 

7 
15 


Test 3. ■ 


— Time, 2 minutes. 




* 




(i) f 


(2) 1 

1 


(3) 


5 

T 


(4)H 
1 


Test 4. 


— Time, 2 minutes. 








(i) + 


(2) i 


(3) 


3 
T 
3 


(4) * 
1 


Test 5. 


— Time, 2 minutes. 








(i) A 


(2) i 

12 


(3) 


1 
¥ 
3 
¥^ 


(4) A 
A 


Test 6. ■ 


— Time, 2 minutes. 








(i) i 


(2) % 

3 

¥ 


(3) 


1 
¥ 
9 
10 


(4) A 

7 

lb 



The directions for scoring the test are not available, and 
without such directions definite comparison cannot be made. 
It seems worth while, however, to indicate the city medians 
for Boston, and these are sxunmarized herewith in Table 25. 
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Table 25, — Summary Sheet — Cirv Medians (Boston) 
Addition of Fractions, December, 1915 





1 


1^.. 


TtBTJ 


T.„, 


Test 4 


T.„. 


Test 6 






1 


g 


1 


1 


1 


J 


1 


J 


1 

5 


s 


S 


Gradi: 


sS 




?i 




s 





K 




S 




P. 







1 


M 


H 


1 


1 


1 


1 

47.0 


6.0 


68.0 


1 

6.5 


^2.0 


I 
6.4 


1 


VIII . . 


11,10 


20.7 


88.0 


II.6 


74.0 


8.4 


47.0 


VII. . . 


1^4,^ 


16..^ 


8;.o 


lO-I 


7,1.0 


7-.S 


46.0 


<r?. 


&Q.0 


6.., 


=i.';.o 


S-7 


4K.0 


\I . . . 


.2^5 


10.7 


80.0 


7.7 66,0 


S-5 


42.0 


« 


70,0 


4.6 


51.0 


44 


49.0 



These tests as given in the Boston schools proved especially 
helpful in the work of analyzing the difficulties of pupils and 
devising drills to raise the efficiency of the children. This 
was evidenced by the increase in both speed and accuracy in 
tests given during the following spring to selected sixth grades. 
The teacher should find these tests in addition of fractions 
very useful, and she can make comparisons on the basis of 
her own rules for scoring. 

The Boston tests in the subtraction of fractions will be 
equally suggestive and helpful to the teacher who is attempt- 
ing to analyze the types of problems and the difficulties en- 
countered by children in the solution of problems in sub- 
traction of fractions. The tests as given follow herewith. 

Subtraction of Fractions 
Showing Examples Used in Tests in Subtraction of Fractions, 

December, 191 6 
Test I. — Time, 2 minutes, 
(i) i (2) ^ (3) « (4) A 

i i 4 -ft 



4 
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Tesn. 


— Time, 2 minutes. 






i 


(a) » 
i 


(3) i 


(4) f 

i 


Test 3. - 


- Time, 2 minutes. 








(2) 1 

A 


(3) 5 
A 


(4) A 
A 


Test 4. - 


- Time, 2 minutes. 






(i) 4 
'A 


(2) 6 
Si 


(3) 6 


(4) 6 
3i 


Test 5. - 


— Time, X minutes. 






W9* 
■1 


(2) 7* 
6? 


(3)7A 
4I 


(4) 7i 

»A 



These tests increase in difficulty from the first to the fifth 
test and some of the examples in tests 4 and 5 are as difficult 
as any likely to appear in actual social and business practice. 
Instructions for the scoring of these tests are not at hand, but 
nevertheless the summary of the Boston medians is given 
herewith in Table 26. 

Table 26. — Sommaky Sheet — City Mediants (Boston) 
Subtraction of Fractions, December, igi6 



II 



1283 
1499 
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Tests were also devised in the multiplication and division 
of fractions. Some of these tests are quite difficult and yet 
for diagnostic purposes they will show the ability of children 
to multiply or divide fractions and mixed numbers. The 
tests are indicated herewith, and city medians are summarized 
in Table 27. 

Multiplication and Division of Fractions 

Showing Examples Used in Tests in Multiplication and Division 

of Fractions, December, 1917 

MultipUcation of Fractions. — Test i. Time, 2 minutes, 
(i) iX6 (2) ixS (3) |Xi2 (4) 12 XA 

Multiplication of Fractions. — Test 2. Time, 4 minutes, 
(i) 246* (2) S73i (3) 275 (4) 4s6i (s) 189 

5 5 _8f _^ _si 

Multiplication of Fractions. — Test 3. Time, 2 minutes. 

(i) 4ixi (2) 7ixf (3) sixl (4) *X2f 

Multiplication of Fractions. — Test 4. Time, 5 minutes. 

(i) 32* (2) 84* (3) 29! (4) 2sf (s) i9i 

69i 79i 28i iy% gyj 

Division of Fractions. — Test 5. Time, 2 minutes, 
(i) 1^8 (2) 9-f (3) 6^t (4) 8^f 

Division of Fractions. — Test 6. Time, 4 minutes. 

(i) s678i■^s (2) 2789f-^4 (3) 2467-5-8! (4) 6752-^12* 

Division of Fractions. — Test 7. Time, 3 minutes, 
(i) l-i (2) 3*-i (3) 5^-^% (4) 6f -^t 

It will be observed, from Table 27, that the scores in tests 2, 
4, and 6 are very low, indicating that they were not well 
chosen. Referring to these particular tests. Dr. Ballou of 
the Boston bureau says: 
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"It is probably true that there is no great use for the type of 
work shown in these three tests in practical life, but the business 
world does require it to some extent ; business courses in our high 
schools require the processes, and the new course of study requires 
this work. In view of these three conditions, it was thought best 
to include these three tests in order that we might have some facts 
on which to base the development of our work in multiplication 
and division of fractions." 

Table 27, — Summary Sheet — City Medians (Boston) 
Multiplication and Division of Fractions 





s 


M„™„„o» 


D™o. 




Tat. 


■"• 


T«t3 


Tslt 


TC51S 


T«(6 


TBt7 




1 


1 


1 


i 


.3 


1 


1 


1 


1 


1 


1 


1 


1 


1 


ClADE 


i 

1 


if 


G 


E 


c 


3 






c 


s 


a 


S 




s 






M 


Is? 




1 


1 


1 


1 


1 


1 


1 


1 


1 


i 


1 


vni 


1027 


II I 


0.1 


8.8 


6,1 


7.6 


8.^ 


4-7 





10. 1 


7^ 


?■■^ 


20 


■o.,i 


70 


VII 


11Q6 


».4 


88 


1-1 


,18 


6.4 


81 


4. J 





S.2 


SI 


a.-) 





8.S 


68 


VI . 


1290 


6.! 


13 


8.3 


° 


4.7 


° 


5-6 


° 


54 


° 


3a 


° 


4-9 


•* 



On the basis of this statement we may expect that the 
course of study in Boston will be improved by much elimi- 
nation and better adaptatioir to business usage. Tests will 
be used more and more in the work of revising courses of 
study. 

Cleveland Survey Tests. — One of a nimiber of hopefxil 
signs in the development of arithmetic tests is the clear 
recognition that they should be of direct value in helping the 
children. This recognition is leading quite clearly to the 
more extensive development of diagnostic tests. The tests 



The Measurement of Arithmetic 97 

used in the Cleveland survey were prepared in cooperation 
with Mr. Courtis, who recognized as clearly as any one else the 
need of supplementary work in order to make his standard 
tests, series A and B, of sufficient value to the teacher whose 
duty is to improve the pupils in their work. No attempt will 
be made in this place to describe or discuss fully the Cleveland 
survey tests.^ The tests, now slightly revised, are composed of 
15 different sets of examples designated as A, B, C, D, E, F, 
G, H, I, J, K, L, M, N, O. They are intended to cover the 
" fundamentals " of arithmetic. Of the 15 tests, four are 
in addition ; two in subtraction ; three in multiplication ; four 
in division; and two in fractions. They constitute to an 
extent a spiral arrangement of tests, increasing in difficulty 
from A to O. The actual time covered by the tests is 22 
minutes, and this combined with the time necessary to pass 
from one test to another led to the direction, in connection 
with the Grand Rapids survey, that two days be taken for 
the tests ; the first nine sets being given on the first day and 
the remaining six sets being given on the following day. As 
indicated, the sets were devised in cooperation with Mr. 
Courtis and they follow the Courtis Practice Forms more or 
less closely. These forms were used in the Grand Rapids 
schools so that the results secured in Grand Rapids may be 
considered quite satisfactory. The results in the Cleveland 
schools were more satisfactory in the lower grades but a little 
less so in the upper grades. Table 28, following herewith, 
shows the average of the median scores in each of the arith- 
metic tests for grades 3 to 8 in Cleveland and Grand Rapids, 
This table may be considered as setting tentative standards 
for the Cleveland survey tests for the various grades. 

* For further discussion see Judd, C. H., "Measuring the Work of the Public 
School," a volume of The Cleveland Survey; School Survey of Grand Rapids, 
Mich., Chap. VI; "Arithmetic Tests and Studies in the Psychology of 
Arithmetic," by Coimts, G. S.; Supplementary Educational Monograph, 
whole number IV. 



98 



Eow to Measure 



Table 28. — Averages of Median Scores in Each Arithmetic Test 
FOR Grades 3 to 8, Cleveland and Grand Rapids Combined 



Set 



A 
B 
C 
D 
E 

F 
G 
H 
I 

J 

K 
L 

M 
N 
O 







Graixb 




8 


4 


6 


6 


7 


134 


17.I 


21.9 


24.9 


27.0 


8.9 


12.8 


16.6 


19-5 


21. 1 


6.5 


II.7 


14.8 


16.8 


18.2 


6.3 


II.4 


15.0 


17.7 


20.3 


4.3 


5-0 


5-9 


6.7 


7.4 


2.0 


4.5 


6.6 


7.7 


9.1 


2.0 


3.6 


5.1 


5.5 


6.0 






5.6 


6.0 


7.7 


0.6 


I.O 


1.7 


3.1 


4.0 


1.9 


3.0 


3.9 


4.4 


5.1 




4.0 


5.6 


7.0 


9.4 




1.7 


2.7 


3-2 


3.8 


1.4 


2.4 


3.4 


4.1 


4.7 




0.8 


I.I 


1.6 
3.3 


1.9 

4.3 



28.9 

25.8 
19.9 

22.8 

8.0 
10.6 

6.7 

8.6 

4.7 
6.1 

11.4 
4.4 

54 
2.4 

5.2 



The four addition sets, A, E, J, M, follow herewith, and they 
may be considered as representative of the spiral arrangement 
and diagnostic character of the Cleveland tests. It will 
be observed that the examples increase in difficulty and 

Set A — Addition. 
1690417932136 
2651237604589 






3 


8 


9 


7 


8 


2 


I 


4 


8 





2 


2 


7 


2 


I 


9 


6 





5 


6 


7 


9 


5 


7 


I 


4 


7 





3 


I 


2 


5 


6 


7 


5 


8 


6 


9 


6 


8 


8 


5 


4 


9 


8 





2 


I 


3 


5. 





4 


2 


9 


7 


4 


5 


7 


4 


8 





3 


9 


2 


3 


2 


3 


8 





2 


I 


9 


6 





4 


I 


8 
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S 


6 


2 


4 




5 


I 


6 




3 


7 


9 


4 


7 4 


3 


I 


8 




9 





2 




3 


4 


8 


^ 5 


SetE 


1 


\ddition. 




















5 


2 




9 




2 




6 




I 




4 


9 


2 


8 




8 




8 




3 




4 




6 


7 


2 


8 









5 




4 




2 




5 


I 





5 




7 









8 




5 




3 


5 


4 


I 




6 




6 




8 




4 




4 


3 


6 


2 




6 




8 




5 




4 




I 


3 


7 


7 




2 




5 




9 









4 


7 


8 


3 




3 




I 




6 




8 




I 


2 


5 


4 




9 




3 




3 




5 




8 


9 


S 


I 




3 




8 




8 




5 




4 


6 


Set J 


— Addition. 




















7 9 


4 


7 


2 


9 




6 


7 


7 


8 


9 


4 


3 2 


5 2 


5 


I 


9 


6 




9 


I 


8 





5 


3 


I I 


4 4 


8 


9 


4 


2 




6 


5 


5 


7 


3 


7 


7 6 


2 8 


I 


4 


8 


4 




7 


I 


4 


I 


4 


7 


6 6 


6 2 


4 


3 


S 


7 







4 


I 


8 


6 





9 1 


7 


8 


2 


I 


I 




4 


6 


8 


5 


2 


2 


6 8 


S 5 


5 


8 


S 


3 




3 


5 


2 


I 


3 


9 


3 6 


I 3 


I 


5 


2 


9 




7 


3 


I 


3 


9 


5 


4 9 


8 6 


3 


2 


4 


2 




I 


3 


3 


7 


2 


6 


5 7 


3 1 


9 


7 


3 


3 




6 


7 


9 


4 


2 


3 


4 S 


2 4 


6 


7 


6 


8 







6 


8 


9 


8 


4 


2 2 


9 8 


3 


I 


7 


5 




6 


I 


4 


4 


5 


8 


9 2 


9 8 


S 


9 


6 


5 




6 


7 


5 


4 


6 


8 


1 i 


SetW 


[ — 


Addition. 




















7493 




8937 




8625 




21 


23 




5142 




3691 


9016 




6345 




4091 




1679 




0376 




4526 


6487 




2783 




3844 




5555 




4955 




7479 


7591 




4883 




8697 




6331 




9314 




2087 


6166 




1341 




7314 




6808 




5507 




8165 
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5226 


9149 


6268 


9397 


7337 


8243 


2883 


8467 


7725 


6158 


2674 


6429 


2584 


0251 


8331 


3732 


9669 


9298 


0058 


7535 


5493 


4641 


5114 


7404 


2398 


5223 


3918 


7919 


8154 


257s 



lend themselves quite well to diagnostic purposes. Set A 
tests the pupils* knowledge of the addition combinations; 
set E is a simple test in column addition; set J involves 
more difficult column addition ; and set M requires carrying 
as well as column addition, and conforms to business usage 
more closely than the Courtis Series B. 

Subtraction is tested in sets B and F; multiplication in 
sets C, G, and L ; division in sets D, I, K, and N ; and frac- 
tions in sets H and O. In each of the fundamental processes 
and in fractions, the first set is quite simple and each later 
set grows more difficult. The detail shown above in addition 
represents the plan in each process. The diagnostic use of the 
Cleveland tests may best be illustrated by taking some actual 
cases. Mary S., a pupil in the eighth grade, thirteen years 
old, makes no mistakes in any of the tests in addition. She 
solves every problem attempted, but her score for the various 
tests in addition is low. They run as follows : A 19, E 5, J 4, 
M 4. The standards for these tests in the eighth grade as 
shown by the Cleveland and Grand Rapids reports (see Table 
28) are A 28.9, E 8.0, J 6.1, M 5.4. It appears, therefore, 
that Mary S. needs to continue her present accuracy in addi- 
tion but increase her speed. In subtraction she makes a 
mistake in test B in subtracting 9 from 11, but her score in 
subtraction is low in both test B and test F. In multiplication 
she is inaccurate as well as slow, making two mistakes in 
set C which are the simple combinations. In division she is 
exceedingly slow. In set K in division the standard score for 
the eighth grade according to Table 28 is 11.4. Mary S. has 
a score of i. 

Turning to the record of Hazel R., a thirteen-year-old girl 
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in the eighth grade, it appears that her scores are down 
throughout, but that her inaccuracies are chiefly in sets H 
and O. These tests, as will be noted above, relate to simple 
fractions. However, in addition, Hazel R. has difficulty. 
She solves twenty problems in set A all correctly and four 
in set E correctly, but in set J involving a colunm of thirteen 
figures she fails on every problem attempted. In set M, 
five addends of four-place numbers, she fails on one out of 
three attempted. 

It is evident, from the above, that rather detailed analysis 
of the pupils' difficulties is easily made from the results of the 
Cleveland tests. When pupils pass set A with proper speed 
and accuracy it means that they know the addition com- 
binations. When they fail on set J it means that the more 
complex numbers involve too much mental effort or that the 
drill on decades has not been sufficient, doubtless the latter, 
because many pupils who know that four and eight are twelve 
fail when the combination is twenty-four and eight. In like 
manner, a pupiPs paper will show for the other fundamental 
processes and simple fractions just where his difficulties be- 
gin and, therefore, just where the teacher needs to begin in 
order to give the necessary help. How to analyze the arith- 
metic difficulties in an entire city system through the use of 
the Cleveland survey tests has been shown by Dr. George 
S. Counts in the School Review Educational Monograph, 
number IV. 

Kansas Diagnostic Tests in Arithmetic. — The Bureau of 
Educational Measurement and Standards, of the State 
Normal School at Emporia, Kansas, has done a notable work 
in connection with the measurement of subject matter. The 
fact that the tests in arithmetic, finally put out by this 
Bureau, were diagnostic in nature, may be taken as a further 
indication that, in the future, increasing importance will be 
attached to diagnostic tests. The Kansas Tests in Arithmetic 
have not been extensively tried out and so are not well 
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standardized. Tentative standards, however, have been issued 
covering the 21 tests. These are the midyear scores and are 
based upon results of testing from 300 to 1200 pupils in each 
of the grades. Table 29, which follows herewith, gives these 
tentative standards for the first six tests. 

Table 29. — Tentative Standards for the Kansas Diagnostic 

Tests in Arithmetic 

The number of pupils taking the tests varies from about 3c» to over 
1200. Midyear scores. R — Rate or number of examples done. 
A — Accuracy or per cent of examples correct 



Grade 


IV 


V 


VI 


vn 


vin 


Test No. 


R 


A 


R 


A 


R 


A 


R 


A 


R A 


I 


7.8 


100 


12.3 


100 


lO.I 


100 


12. 1 


100 


II. 9 100 


2 


3-7 


60 


7.3 


100 


7.0 


ICX> 


8.0 


ICX> 


8.9 100 


3 


3-1 


57 


4.9 


75 


5.1 


79 


5.5 


83 


6.2 84 


4 


2.1 


40 


2.9 


60 


3-4 


68 


4.3 


79 


4.6 88 


5 


4.2 


52 


5-2 


64 


5-3 


63 


5-4 


63 


6.1 66 


6 


1.9 


38 


3-4 


70 


3.2 


74 


4.7 


70 


4-5 100 



These six tests cover simple work in addition, subtraction, 
multiplication, and division and are reproduced with the 
consent of the author. 



PART I — TESTS 1-6 
Operations with Integers 



At. 



Test I 


— Addition. 
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4 


I 9 
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3 
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I 
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At. 



Test 


2 — 


Subtraction. 










Rt. 


1 • • • 
' • • • 




37 
5 


94 
8 


60 
3 


27 
6 




39 

7 


41 

8 




77 
3 


53 
9 


65 

2 


"80 
4 


92 

5 


70 
3 




68 
2 


58 
9 




26 
9 


43 
8 


95 
4 


50 

7 


36 

I 


34 
8 




44 
6 


25 
3 




63 

7 


57 
9 


Test 


3 


Multiplication 


.• 








At. 
Rt. 


' • • • 
1 • • • 




6572 
6 




6750 
9 




5863 
2 




3754 
5 






2845 
8 


4936 
4 




9327 

7 




8274 
3 




8409 
6 






6391 
9 


5482 
2 




8609 

5 




3679 
8 




2758 
4 






4658 

7 


9653 
3 




3174 
6 




2874 
9 




7901 
2 






2179 
5 


Test 


4 


Division. 










At. 
Rt. 


■ • • 
1 • • • 




8)3840 




4)7432 




7)3534 




3)9430 






6)4680 


9)^577 




2)6370 




5)9310 




8)7512 






4)3820 



7)9653 



3)5781 



6)6720 



9)5373 



2)5130 
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At. 



Test 5 — Addition. 



Rt. 



64)4992 



92)6624 



24) icx>8 



7862 


6809 


8941 


5917 


6772 


7864 


1249 


5013 


7623 


7910 


4814 


6028 


7883 


8975 


I76I 


5299 


9845 


9007 


6535 


8240 


900s 


5872 


6601 


8522 


6975 


2340 


8969 '* 


1573 


3739 


3496 


1046 


1227 


2319 


6794 


3203 


8758 


2462 


1247 


4319 


6794 


3293 


7917 


2350 


9869 


3573 


2358 


5420 


7805 


4304 


3197 


4572 


1081 


5795 


4570 


7642 


9027 


2338 


6420 


7805 


4314 


8028 


7803 


9975 


5917 


6772 


9864 


1249 


8758 


2462 


1247 



Test 6 - 


- Division. 




At.. 
Rt.. 




82)3854 


43)1591 


74)2664 




31)1953 


63)3591 


94)4042 


21)1407 




53)4452 


83)5312 


42)672 


71)5183 




32)2304 


62)2108 


93)5022 


23)874 




51)2703 


84)7140 


41)3567 


73)6278 




33)1386 



52)3484 



The value of these tests can be ascertained only by extended 
use. They follow rather closely the line of testing marked 
out by the Cleveland survey tests. Tests 7 to 11 cover 
operations with integers in the fundamentals and are similar 
to tests I to 6, except that they are more difficult. Tests 
12 to 16 cover operations with common fractions. These 
tests are simple and not beyond reasonable business demands. 
They should be of value in helping teachers to locate a pupil's 
difficulties in handling fractions. Tests 17 to 21 deal with 
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the multiplication and division of decimals and relate par- 
ticularly to the problem of placing the decimal point. There 
will quite surely be objections to these tests because the 
examples in the tests are more difficult than problems in 
decimals which appear in common business practice. It is 
apparent that the test in decimals has been based upon an 
analysis of textbook material rather than actual usage. 

The Teacher's Problem. — In view of the many tests in 
arithmetic which are now available, the teacher may feel 
confused and uncertain as to just how she should proceed. 
Apparently the best method would be to master, one at a 
time, the details of testing achievement in arithmetic. The 
teacher will do well to start with the tests which developed 
first historically, namely, the Courtis tests, series B. While 
these tests have defects which are admitted even by their 
author, yet they are better standardized than any other tests 
at the present time and so will best answer the purpose of 
measuring achievement in arithmetic, particularly for upper 
grade pupils. They are as serviceable as any other tests at the 
present time. When the teacher feels confident that she 
understands the Courtis tests, knows how to administer them, 
to grade them and to apply them to the remedial work in her 
own schoolroom, then she may properly take up the Woody 
scales for the fundamentals. These are quite well standardized. 
They are easily administered and they are valuable for diag- 
nostic purposes. Many teachers will not use other tests. 
Some however, will begin to see the advantage of standard 
tests and will desire to take up the Cleveland survey tests, 
applying them to their own schools, comparing their schools 
with others which have been measured by these tests, but 
particularly using them for locating the weaknesses of their 
own pupils. These tests were originally designed for diagnostic 
purposes and they seem well calculated to serve this purpose. 

In a year of earnest work, the teacher should master testing 
pupils in abstract numbers, should locate the shortcomings 
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of her class, and should make decided progress toward bringing 
the entire class to standard. Many teachers will prefer to 
wait until a second year to begin the use of the reasoning tests. 
This may be a wise procedure because their use should be 
followed by a thorough study of reasoning problems, the 
progressive steps which pupils must take in mastering such 
problems, and the business demands for such problems. A 
valuable study on reasoning problems, and the successive steps 
in their mastery, is found in the Connersville Course of Study 
in Elementary Mathematics. Since figuring which is done in 
actual business is always connected with situations requiring 
reasoning, the teacher should not fail to carry forward her 
work until she has mastered the details of administering, 
scoring, and interpreting reasonmg tests. 

The Next Step. — What is the next step in measurement in 
arithmetic? Some say it is to devise tests for the measure- 
ment of the higher processes in arithmetic. This may be so, 
but it is to be hoped that before such tests are formulated, 
the needs of common business practice will be more fully 
determined. If tests were now formulated for denominate 
numbers, percentage and its appUcations, mensuration, etc., 
they would almost surely represent merely textbook and 
schoolroom viewpoints. The results would doubtless be less 
satisfactory than the Kansas tests in decimals. It is to be 
hoped, therefore, that the more fundamental work of deter- 
mining the actual community and business demands of 
Arithmetic will be carried much further before any attempt 
is made to extend measurement in arithmetic to the higher 
processes. Progress is being made along this line ^ and in 
time we may hope to have a type of arithmetic throughout 

^ See particularly, Wilson, G. M., "The Social and Business Usage of Arith- 
metic," Teachers College Bureau of Publications; Mitchell, H. Edwin, "Some 
Social Demands in the Course of Study in Arithmetic," Teachers College 
Bureau of Publications; the ist, 2d, and 3d reports of the Committee on 
Minimum Essentials of the National Education Association, Public School 
Publishing Co. ; and the Iowa Elimination Reports. 
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the entire course, which is directly applicable to business usage 
and which is so taught as to further the inteUigent use of 
arithmetic in business. In the meantime, teachers are quite 
safe in furthering the work of measurement in arithmetic 
in the fundamental processes and simple fractions. Teachers 
may assume that mastery here is essential, and that measure- 
ment is valid so long as applied only to the formal aspects 
of the subject. 
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metic," Teachers College Bureau of Publications. 

12. Monroe, W. S., "An Experimental and Analytical Study of Woody^s 

Arithmetic Scales," Series ^j School and Society jVLj pp. 412-420, 
October 6, 191 7. 

13. Anderson, C. J., "The Use of the Woody Scale for Diagnostic Pur- 

poses," Elementary School Journal, June, 1918, pp. 770-781. 

in. Boston Research Tests in Fractions 
Copies not available for distribution. 

14. Ballou, Frank W., In bulletins of the Department of Educational 

Investigation and Measurement, of Boston. No. VII: "Arith- 
metic. Research Tests in Addition of Fractions"; No. XV: 
"Arithmetic. Achievement of Pupils in Common Fractions." 

IV. The Cleveland Survey Tests in Arithmetic 

Supplies may be secured by addressing Director Charles H. Judd, 
of the School of Education, Chicago University. 

15. Judd, Charles H., "Measuring the Work of the Public Schools." A 

volume in the Cleveland Survey Series. 

16. Counts, George S., "Arithmetic Tests and Studies in the Psychology 

of Arithmetic," Supplementary Educational Monograph, Whole 
Number IV, Chicago University Press. 

17. O'Hern, Joseph P., "Practical Application of Standard Tests," 

Elementary School Journalj May, 1918, pp. 662-679. 

18. Smith, J. H., "Individual Variation in Arithmetic," Elementary 

School Journalj November, 191 6, pp. 195-200. 

19. Heckert, J. W., "The Cleveland Survey Tests in Arithmetic in 

the Miami Valley," Elementary School Journal, Vol. 17, February, 
1918, p. 447. 

20. School Surveys : Cleveland, Grand Rapids, St. Louis. 
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V. Kansas Diagnostic Tests in Arithmetic 

Supplies may be secured from the Bureau of Educational Measure- 
ments and Standards, Emporia, Kansas. 

21. Monroe, W. S., "Ability to Place the Decimal Point," Etementary 

School Journal, 18: 287-293, December, 191 7. 

22. Wilson, G. M., "The Proper Content of a Standard Test," Eh- 

mentary School Journal^ 19 : 375-381, January, 1919. 

23. Monroe, W. S., "Diagnostic Tests in Arithmetic," Elementary School 

Journal, 19 : 585-607, April, 1919. 

VI. Stone Reasoning Tests 

24. Stone, C. W., "Arithmetical Abilities and Some Factors Determin- 

ing Them," Teachers College Contributions to Education, No. 
19, Columbia University, New York City. 

25. Stone, C. W., "Standardized Reasoning Tests in Arithmetic and 

How to Utilize Them," Teachers College Contributions to Edu- 
cation. No. 83. 

26. School Surveys: Butte; Salt Lake City; Nassau Coimty, New 

York. 

Vn. Better Selection of Subject Matter in Arithmetic 

27. McMurry, Frank M., "The Elimination of Useless Material," 

1904 Yearbook, National Education Association. 

28. Wilson, G. M., "Course of Study in Elementary Mathematics," 

Warrick and York, Publishers, Baltimore, Maryland. (In press.) 

29. Wilson, G. M., "A Survey of the Social and Business Usage of 

Arithmetic," Sixteenth Yearbook of National Society for the 
Study of Education, Part I, pp. 128-142. 

30. Wilson, G. M., "A Survey of the Social and Business Usage of 

Arithmetic," Dissertation, Teachers College Bureau of Publi- 
cations, Columbia University, New York City. 

31. First and Second Iowa Elimination Reports. 

32. Mitchell, H. Edwin, "Some Social Demands on the Course of 

Study in Arithmetic," Seventeenth Yearbook of the National 
Society for the Study of Education, Part I, pp. 7-17. 

33. Wise, Carl T., " A Survey of Arithmetic Problems Arising in Various 

Occupations," Elementary School Journal, 20: 118-136, October, 
1919. 



CHAPTER V 

THE MEASUREMENT OF READING 

" Most of the reading done by people after leaving school 
will be silent reading. I am of the opinion that much more 
stress should be laid upon it in the schoolroom. The Monroe 
Standardized Silent Reading Tests are a help to the teacher at 
the beginning of the school year in finding just where the 
pupils stand in reading. They also help her to discover the 
individual needs of each pupil. Tests given at the close 
of the term show the progress made." This statement was 
made by a teacher of a third grade after she had used the 
Monroe Standardized Silent Reading Tests for one half 
year to determine the ability and the progress of her children 
in reading. It reveals two or three important considerations 
for successful reading achievement. 

Teachers are beginning to realize the importance of placing 
more emphasis upon silent reading, since so much reading 
outside of the schoolroom is silent and since success in a voca- 
tion and happiness in leisure time depend upon the individual's 
ability to grasp the thought on the written page. The 
statement also reveals to the teacher the importance of 
knowing in a definite and objective manner what her class 
as a whole and what each individual in her class can do in 
reading. It is also necessary to know in definite and objective 
terms the amount of progress made by the class as a whole 
and by each individual student at the expiration of a certain 
period of time. 

The importance of silent reading cannot be overestimated. 
The ability to grasp quickly the thought in a paragraph, a 
chapter, or a book, is one of the biggest contributing factors 

no 
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to the success of any individual. The ability to master 
quickly the thought in modern literature, current events, 
history, etc., determines to a large extent the progress and 
development of the individual among his fellow men. It has 
much to do with his happiness during his whole life. If, 
therefore, reading plays such a significant part in life, is it 
not a matter of considerable importance that, in the training 
of the child to master the symbols with which ideas are ex- 
pressed, the best methods of instruction be used, his capacity 
be trained to the fullest extent with the least degree of waste, 
and his appreciation of the best that is said and written be 
developed? To this end the reading test as has been indicated 
is a most helpful instrument. 

Among the different factors that contribute to good reading, 
which is made up of the abiUty to interpret and remember, 
there are two which are of great unportance and which 
the teacher can determine in a definite manner. These 
factors are, first, the power of comprehending thought, and 
second, the rate of reading. These two factors are important 
in both kinds of reading, namely, silent and oral. 

Since it is not the purpose of this work to make an ex- 
haustive study of all the tests on anyone subject, a treatment 
is given of those reading tests only which, on account of their 
aim, simplicity of application, and extent of use are most 
serviceable in the hands of teachers. The following tests 
meet this purpose: Thorndike's Scale Alpha 2, Courtis's 
Silent Reading Test No. 2, Monroe's Standardized Silent 
Reading Tests, Haggerty's Visual Vocabulary Test, and 
Gray's Oral Reading Tests. 

Monroe's Standardized Silent Reading Tests 

Aim. — The aim of these tests is to determine the rate at 
which children read and the extent to which they are able 
to comprehend the thought in written discourse. This is 
accomplished by having them read silently a given sentence 
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or paragraph and then write answers to questions on the 
subject matter they have read. 

Description of Test. — Monroe's Standardized Silent Read- 
ing Tests have been constructed from sentences taken " from 
school readers and other books which children read." They 
are intended for grades three to twelve inclusive. They 
consist of three tests: 

Test one for Grades 3, 4, and 5. 
Test two for Grades 6, 7, and 8. 
Test three for Grades 9, 10, 11, and 12. 

Tests one and two have each three forms, forms i, 
2, and 3 ; test three has two forms, forms i and 2. Each 
form is of the same degree of difliculty, but has different 
subject matter so that the same class can be examined three 
times without using the same information. The first six 
paragraphs of test i, form i, which has in all sixteen para- 
graphs, are given in order to show the nature of the tests : 



Rate 
Value 8 



Rate 
Value 7 



No. I 

"I am not playing, little girl," said the 
squirrel. **I am running to my home in the 
hollow tree. Don't you hear my babies call- 
ing me? I must feed them." 

Where was the home of the squirrel ? 

In the 



No. 2 

The little Pilgrim girls carried their work 
boxes to the dame-schools and learned to sew 
and knit as well as to read and write. 

Where did the girls go with their work 
boxes? 

To the 



Compre- 
hension 
Value 1.3 



Compre- 
hension 
Value 1.3 
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Rate 
Value 7 



Rate 
Value 9 



Rate 
Value 6 



Rate 
Value 7 



No. 3 

When the white men first came to this 
country they found the red men, or Indians, 
living in wigwams, made of long poles and 
covered with skins. 

Which people Uved here first, the white or 
red? 

No. 4 

Hiawatha was a little Indian boy. He had 
no father and no mother. He lived with his 
grandmother, Nokomis. His home was in a 
wigwam. Draw a line under the word that 
tells whom Hiawatha lived with. 

Father, aimt, mother, uncle, sister, grand- 
mother. 

No. 5 

The cabin of Uncle Tom was a small log 
building close adjoining to " the house," as 
the negro designates his master's dweUing. 

Of what material was Uncle Tom's cabin 
built? 

No. 6 

A crab who lived in a sand-hill was sitting 
at his door in the sun eating a rice cake. An 
ape went by, carrying an orange seed. 

Where did the crab live? 



Compre- 
hension 
Value 1.3 



Compre-' 
hension 
Value 1.4 



Compre- 
hension 
Value 1.4 



Compre- 
hension 
Value 1.5 



Each test begins with a simple exercise and increases in 
comprehension value with almost each succeeding exercise. 
The measure of a child's ability to understand or comprehend 
what he reads in each exercise forms the comprehension 
value and is placed in the right-hand margin opposite each 
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exercise. The sum of the values of those exercises done 
correctly in five minutes forms his comprehension score. 

It is important not only to know the extent to which a child 
can grasp the thought in an exercise, but also the time in 
which it takes him to grasp this thought. The child who can 
grasp the thought in an exercise in three minutes has greater 
reading abiUty than the child who requires five minutes to 
grasp the thought in the same exercise. Each exercise is, 
therefore, given a rate value which represents the nimiber of 
words read per minute in careful reading. This value is 
placed on the left-hand margin opposite each exercise. The 
pupil's reading score is the sum of the rate values given to the 
different exercises which he reads in five minutes. 

Giving the Test. — The application of this reading test is 
very simple. Full instructions are given on the front page 
of each test. Each child is provided with a separate test. 
After he has written his name, age, grade, etc., he is given 
a preliminary test which explains how he is to proceed with the 
regular test. The value of the results will depend on the 
accuracy with which these simple instructions are observed. 

Scoring the Results. — A class record sheet is provided 
with each test. A copy of this record sheet with the results 
from a 3-A grade is given in Table 30. 

The score for the class of thirty-nine children reported 
in Table 30 is given in terms of a median instead of an average. 
This median score is the score on the middle paper in the 
group of papers that is being recorded. In March, thirty- 
nine children were tested. The comprehension score on the 
20th paper, counted from the paper receiving the lowest 
score, was 10.6. In a similar manner the median score for 
the rate of reading was determined. 

Interpreting and Using Results. — After a class has been 
tested and the results tabulated, the important work of inter- 
preting these results and applying them to particular needs 
still remains. Unless these two steps are developed to the 
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Table 30. — The Results from Monroe's Standardized Silent 
Reading Tests Given to a 3-A Grade in March 1918. 

CLASS RECORD SHEET i 

City D School B Grade 3-A 

Teacher N. E. Date March 22, 1918. 



Rate Score 



Interval 



160 


to 169 


ISO 


to 159 


140 


to 149 


130 


to 139 


120 


to 129 


no 


to 119 


100 


to 109 


90 


to 99 


80 


to 89 


70 


to 79 


60 


to 69 


so 


to S9 


40 


to 49 


30 


to 39 


20 


to 29 


10 


to 19 





to 



Number 
of Pupils 



Total 



Median 



I 
I 
I 

7 
5 

5 

I 

15 
2 

I 



39 



70 



COICPSEHENSION SCORE 



Interval 



70 & above 
67 to 68 
63 to 6s 
60 to 62 
57 to 59 
54 to 56 
51 to 53 
48 to 50 
45 to 47 
42 to 44 
39 to 41 
36 to 38 

33 to 35 
30 to 32 
27 to 29 
24 to 26 
21 to 23 
18 to 20 
15 to 17 
12 to 14 

9 to II 

6 to 8 

3 to 5 

O to 2 



Total 



Median 



Number 
of Pupils 



2 

I 

3 

I 

10 

9 
9 

3 

I 



39 



10.6 



Instructions for Making the Dis- 
tribution OF Pupils* Scores, and 
FOR Finding the Median Score 

1. The teacher must be careful 

that her papers are grouped 
correctly by classes. If she 
has but one grade of pupils, 
say sth grade, or but two 
divisions of one grade, say 
5th A and 5th B, then her 
papers are all grouped to- 
gether and but one " distribu- 
tion" made. If, however, 
she has parts of two or more 
grades, say part sth and part 
6th, she must make two or 
more piles of papers, one for 
each grade. 

2. Arrange the children's papers 

for any class group in order 
of the comprehension scores, 
the lowest score on top. 

3. To make the distribution called 

for, count the number of pa- 
pers whose scores fall within 
the successive groups listed. 
For instance, if the lowest 
score is 3, the next lowest 6, 
the next 7, 7, 11, and so on, 
you will put " I " in the group 
marked "3 tos"; "3"inthe 
group marked " 6 to 8 " ; " i " 
in the group marked "9 to 
II," and so on until the whole 
number of scores are record ed . 
The sum of these numbers 
must equal the number of 
children taking the test. 

4. The median score is the score on 

the middle paper in the pile 
of papers arranged accorcUng 
to size of scores. If there are 
35 papers, the median score is 
the score on the i8th paper. 
If there are 36 papers, the 
median score is halfway be- 
tween the score on the i8th 
paper and the score on the 
19th paper. 

5. Repeat 2, 3, and 4, for the rate 

scores. 
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fullest extent the time and the energy of teacher and pupils 
are not justified. The extent to which the teacher has been 
trained or trains herself to complete the testing operation will 
determine her degree of success or failure in the use of measure- 
ments in her classroom work. 

One of the first things the teacher considers after she has 
attained the final scores for her class is the relation which they 
bear to any established standards for the test which she has 
given. 

The standards {middle of year) for the Monroe Standardized 
Silent Reading Tests are as follows : 



G«*UE 


m 


IV 


V 


V 


vu 


VIU 


IX 


X 


XI 


xu 


Rate . . . 
Comprehension 


52 
7-2 


73 
13 


8q 
19 


88 


gg 

23 


io6 

36.4 


S7 

25 


8i 

15 


88 
a6.4 


89 

27-2 



The scores for the 3-A class reported in Table 30 are, rate 
70 and comprehension 10.6, It will be seen, therefore, that 
the class is above the standard in both comprehension and 
rate. 

In the second place, the large spread of ability in rate and 
comprehension called for intimate knowledge with, and in- 
dividual instruction to suit, these abilities. 

By referring to the class record sheet, it will be seen that 
in rate of reading at least 21 pupils surpassed the standard 
(52) for the third grade, 15 pupils scored above the standard 
(73) for the fourth grade, and 10 pupils above the standard 
(89) for the fifth grade. In comprehension at least 26 pupils 
scored above the standard (7.2) for the third grade, at least 
7 pupils scored above the standard (13.0) for the fourth grade, 
and at least 3 pupils above the standard (19.0) Mr the fifth 
grade. 

A clearer picture of this situation may be secured by 
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putting these results in the form of a graph as indicated in 
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Another way of seeing clearly the wide spread of ability 
in this class is by a comparison of the highest and lowest 
scores. The rate scores range from 31 words to 129 words 
per minute, and the comprehension scores from i to 26. 

Remedial Measures. — The first problem for the teacher 
who gave the test reported in Table 30 was to reclassify her 
grade. It would be exceedingly unfair to keep the 7 pupils 
(using comprehension as a basis) who score above the fourth 
grade standard continuing in third grade reading. These 7 
pupils should, therefore, either be promoted to a fourth grade 
(in a short time some of them may be advanced to a fifth 
grade) or given fourth grade reading work under the same 
teacher. 

A second problem is emphasis on thought getting in silent 
reading. From the low scores in both rate and comprehension 
when compared with the highest scores, it is quite clear that 
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there are a good many pupils who do not grasp quickly the 
thought in what they read. The teacher who gave the test 
reports as follows : "In most cases those who received high 
rate scores also received high comprehension scores, tending 
to show that the rapid reader usually understands better 
what he reads. I, therefore, planned to give more time to 
silent reading. The pupils were questioned and marked on 
the number of questions which they were able to answer 
correctly." This plan helped them to understand what they 
read and made the lesson interesting." 

In addition to the above plans, the teacher found the follow- 
ing devices helpful : 

1. Practice was given in the selection and planning of the 
most interesting part of a story for dramatization. 

2. Directions for reading were given by indicating thought 
units rather than paragraphs or hnes. For example the 
teacher says, " William, read from the place where it tells us 
he (the character) decides to leave home," rather than 
" Read the next." 

3* The pupils were given an opportunity to work up in- 
dividual assignments. 

4. The pupils were trained to outhne the main points or 
events in the lesson. 

5. Considerable opportunity was given for the pupils to 
question one another on the content without quibbling or 
asking silly questions. 

6. Other examinations with the same tests were made to 
show progress. 

No one can question the value of such a procedure in which 
the teacher gives a diagnosis of a teaching problem, and 
applies corrective measures in the form of intelligent and 
systematic practice. The plan is simple so that what this 
teacher has done any teacher can do. Surely the gratification 
of knowing without a doubt whether or not success is attending 
one's effort is worth while. 
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Silent Reading 

Thomdike^s Scale Alpha 2, for Measuring the Understanding 

of Sentences 

Aim. — The purpose of Thomdike's Scale Alpha 2 is to 
measure the student's ability to comprehend the meaning 
of sentences and paragraphs. The rate of reading is not 
measured. 

Description of Tests. — The test is divided into two parts. 
Part I may be used in grades 3 to 5 ; Part II in grades 6 to 12 
inclusive. The test is made up of a series of paragraphs with 
questions on the content of each paragraph. Each para- 
graph increases in difficulty over the preceding one with 
equal steps. Below are given the four sets of paragraphs in 
Part I to show the nature of the test : 

Set I. Difficulty 4 (Approximately) 

Read this and then write the answers. Read it again if you need to. 

John had two brothers who were both tall. Their names were 
Will and Fred. John's sister, who was short, was named Mary. 
John liked Fred better than either of the others. All of these 
children except Will had red hair. He had brown hair. 

1. Was John's sister tall or short? 

2. How many brothers had John ? 

3. What was his sister's name? 

Set II. Difficulty 5.25 

Read this and then write the answers. Read it again if you need to. 
Long after the sun had set, Tom was still waiting for Jim and 
Dick to come. "If they do not come before nine o'clock," he 
said to himself, "I will go on to Boston alone." At half past 
eight they came, bringing two other boys with them. Tom was 
very glad to see them and gave each of them one of the apples he 
had kept. They ate these and he ate one too. Then all went on 
down the road. 



I20 How to Measure 

1. When did Jim and Dick come ? 

2. What did they do after eating the apples? 

3. Who else came beside Jim and Dick? 

4. How long did Tom say he would wait for them? 

Set III. Difficulty 6 

Read this and then write the answers. Read it again if you need to. 
It may seem at first thought that every boy and girl who goes 
to school ought to do all the work that the teacher wishes done. 
But sometimes other duties prevent even the best boy or girl from 
doing so. If a boy's or girl's father died and he had to work 
afternoons and evenings to earn money to help his mother, such 
might be the case. A good girl might let her lessons go undone 
in order to help her mother by taking care of the baby. 

1. What are some conditions that might make even the best boy 
leave school work unfinished ? 

2. What might a boy do in the evenings to help his family?. . . . 

3. How could a girl be of use to her mother? 

4. Look at these words : idle, tribe, inch, it, ice, ivy, tide, true, 
tip, top, tit, tat, toe. Cross out every one of them that has an i 
and has not any / {T) in it. 

Read this and then write the answers to 5, 6, and 7. Read it again 
if you need to. 

Nearly fifteen thousand of the city's workers joined in the 
parade on September seventh, and passed before two hundred 
thousand cheering spectators. There were workers of both sexes 
in the parade, though the men far outnumbered the women. 

5. What is said about the number of persons who marched 
in the parade ? 

6. What did the people who looked at the parade do when it 
passed by? 

7. How many people saw the parade ? 
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Set IV. Difficulty 7 

Read this and then write the answers to i, 2, 3, and 4. Read it 
again if you need to. 

You need a coal range in winter for kitchen warmth and for 
continuous hot-water supply, but in summer when you want a 
cool kitchen and less hot water, a gas range is better. The XYZ 
ovens are safe. In the end-ovens there is art extra set of burners 
for broiling. 

1. What effect has the use of a gas range instead of a coal range 
upon the temperature of the kitchen ? 

2. For what purpose is the extra set of burners? 

3. In what part of the stove are they situated? 

4. During what season of the year is a gas range preferable ? . . . 



Read this and then write the answers to ^^ 6, and 7. Read it again 
if you need to. 

Hay fever is a very painful, though not a dangerous, disease. 
It is like a very severe cold in the head, except that it lasts much 
longer. The nose runs; the eyes are sore; the person sneezes; 
he feels imable to think or work. Sometimes he has great diffi- 
culty in breathing. Hay fever is not caused by hay, but by the 
pollen from certain weeds and flowers. Only a small number of 
people get this disease, perhaps one person in fifty. Most of those 
who do get it can avoid it by going to live in certain places during 
the summer and fall. Almost every one can find some place where 
he does not suffer from hay fever. 

5. What is the cause of hay fever ? 

6. How large a percentage of people get hay fever ? 

7. During what seasons of the year would a person have the 
disease described in the paragraph? 

Giving the Test. — The fact that the test can be easily 
given to large groups of children at once makes it a useful 
instrument in the hands of the teacher. Before the children 
are asked to take the actual test, a preliminary test with 
instructions and subject matter very similar to the regular 
test is given to make sure that every child knows exactly 
what he is to do. 
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A test sheet is then placed in the hands of each child. 
After he has written his name and age, he is asked to follow 
instructions which are at the head of each paragraph. He is 
given as much time as is needed to do all he can. 

Recording and Determining Scores. — The correct answers 
to the different questions under each paragraph have been 
carefully determined and arranged on an answer sheet which 
should be in the hands of each teacher. A portion of this 
answer sheet is given below : 

Key for Scale Alpha 2 

Difficulty 4, Element I. "Short." 2. "Two." 3. "Mary." 

Difficulty 5J, Element i. "Half past eight," "eight thirty," 

"8 : 30 " or equivalent. 2. "Went down the road," or equivalent 

(call "went on," or "went on to Boston," wrong). 3. "Two 

other boys," or "two boys." 4. "Nine o'clock." 

Difficulty 6, Element i. Right responses are such as: "If he 
has to work afternoons and evenings to help his mother." "When 
their parents died." "When the father dies." "If his father died 
and to work." "If his father died or if sick." "His father 
might die." " His father may died." " If his father died he has to 
work." "If his father or mother died." 

Etc. 

This answer sheet in the hands of each teacher reduces the 
amount of error due to personal opinion. The rating, there- 
fore, becomes accurate to a remarkable degree. The answer 
to each question is marked right or wrong according to the 
answer on the answer sheet. 

After the marking of the papers is completed these markings 
are transferred to a class record sheet. This sheet has on 
it the number of each question under each set. It calls for 
the name of each child and his scores opposite under each set. 

Table 31 shows a copy of a record sheet from a 5-B grade 
in a city school system in which the Thorndike Scale Alpha 
2 was given. 
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Table 31. — Record Sheet for Thcsindike Reading Scale, Alpha a 

Grade B — 5 Teacher's Name Mrs. C. 

Date of Test March 1919. 



J.K. . . 

D. F. . . 

V. S. . . 

J.T. . . 

D. C. . . 
C. McL . 
P. A. . . 
H. B. . . 
P. R. . . 
H. E. . . 
A. D. . . 

E. C. . . 
C. P. . . 
P.B. . . 
LB. Jr. . 
W. C. . . 
S. F. . . 
G. S.' . . 
J. McL . 
G. M. . . 
G. S." . . 
G. B. . . 
Total number wrong 
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If a child's answer to a question is correct the space opposite 
his name for this question is left vacant. If the answer is 
wrong a zero representing an error is recorded. The total 
number of errors which each child makes on each line is then 
determined. The value of the line on which he makes errors 
not to exceed 20% is his score. The total nimiber of errors 
for the entire class is determined for each line. The value of 
the hne which gives 20% of error is the score for the class. 
If no Une gives exactly 20% of error the value of the line which 
gives nearest to 20% is used. 

On the preceding record sheet, the class made 10.4% of 
errors on the line under Set II and 38.8% of errors on the 
line under Set III. The line which gave nearest to 20% of 
errors is Set II, Difficulty 5.25. By referring to a table of 
errors (index) it will be seen that when a line gives 10.4% 
of errors, .62 is to be added to the value of this Une to secure 
the score. The score for this class is, therefore, 5.25, the 
value of Set II, plus .62, or 5.87. 

Interpreting the Results. — This test is of great assistance 
to the teachers on account of its diagnostic value. Teaching 
the child to get the thought in what he reads is one of her 
most difficult and yet most important tasks. With the aid 
of this test the teacher can determine accurately his ability to 
comprehend the thought in a sentence or paragraph in con- 
junction with the class as a whole or with each member of 
the class. 

The class record sheet given above reveals the fact that 
the ability of the different children to understand the thought 
in these paragraphs varies widely. For example M. H. and 
G. S.^ answered each question correctly in Sets II and III. 
On the contrary A. S. answers three questions out of four 
correctly in Set II and only two out of seven in Set III. 
D. C. answers all the questions in Set II correctly but only 
one out of seven in Set III. Every teacher should keep before 
her the class record sheet and also each child's test sheet. 
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From these sheets she can tell the kind of instruction each 
child should receive ; they will likewise tell her if he should be 
advanced to another grade. In fact, since thought getting is 
such an important factor in practically all subjects, the teacher 
will have considerable information about each child's general 
progress. 

The following table shows the grade scores in a particular 
school in comparison with the Indiana medians : 



Table 32 



»„.. 


S-6 


3-A 


4-fl 


4-A 


S-B 


5~A 


«^B 


«-A 


7-B 

6.33 


j-A 


F 


3-A 


Class scores . . 


S-I2 


5.48 


S.48 


5.97 
6-56 


S.87 


S-S8 
7-56 


!! 


6.66 
8.46 


7.148.C39 
8.73 


7.63 
9- 



The above comparison reveals the fact that the reading 
ability in the school was below the median scores in the 
Indiana cities. 

The general conclusion of the teachers in this building was, 
therefore, that the children were not able to interpret the 
thought in a written sentence or paragraph. 

Using the Results. — The value of this test will be found 
in the direct application of the results to classroom practice. 
Practically every recitation affords such an opportunity. 
Getting the thought in what has been said or read is the big 
problem in all teaching. The teachers of the school in which 
the results in Table 32 were secured report as follows : " We 
stressed thought getting in all subjects such as geography, 
history, arithmetic, and reading. The assignments included 
questions calling for thought getting. The children were also 
asked to prepare questions calling for the thought in a certain 
paragraph or chapter which the members of the class were 
asked to answer." 
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One of the teachers in this school points out the following 
advantage of this test : " Another valuable service which 
these tests rendered was in determining promotions. In the 
June promotions the mother of a little girl who took the test 
questioned her child's promotion mark. When the mother 
was shown the rate which her child received on the reading 
test in comparison with the rating of other children and the 
answers to her questions on the different paragraphs, she 
was entirely reconciled to the justice of the teacher's marks. 
The mother also learned facts about her child which were 
unknown to her before. The child's answers to the ques- 
tions emphasized the fact that she frequently had flights 
of fancy which carried her far from the point in question." 

The teacher of a 4-A grade in which the test was given in 
March and June 191 8 reported that " a class score of 5.55 
was obtained in March and a class score of 6.1 1 in June. 
Some of the weaker pupils improved from 4 to 5.25 and in one 
case from 4 to 6." The teacher further suggests that, " If 
these tests were given during the first week of the temi, the 
teacher would know in a general way how the class ranks and 
would also know the ability of each individual pupil in the 
understanding of sentences." 

The Thorndike Scale Alpha 2 for Measuring the Under- 
standing of Sentences can easily be given by the teacher. To 
the child it means nothing more than a class exercise in which 
instructions are carefully given. It measures one of the most 
important results of a teacher's efforts. For this reason it is 
strongly recommended to teachers in the teaching of reading. 
It is one of the best silent reading tests available. 

Courtis Silent Reading Test No. 2 

Aim. — The aim of the Courtis Silent Reading Test No. 2 
is to measure the rate and the amount of comprehension in 
silent reading. 
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Description of Test. — This test is published in two editions, 
Form I, in which appears a story entitled " The Kitten Who 
Played May Queen," and Form II, " The Kitten Who Went 
to a Picnic." Each story is of equal difficulty. The test is 
suitable for Grades 2 to 6 inclusive. Each test is divided into 
Part I and II. Part I measures the rate of reading. Part 
II measures the comprehension of reading. 

Giving the Test. — Instructions to the pupils are printed 
on each test. These instructions are simple and easily under- 
stood by the children. Each child is permitted to read for 
three minutes, at the expiration of which time a line is drawn 
around the last word that is read. This will determine the 
rate of each child's reading. The child is then given five 
minutes in which to answer questions on what he has read. 
The number of questions answered correctly will determine 
the extent of his comprehension. 

Scoring the Results and Computing Class Scores. — The 
author has provided careful instructions for the giving and 
scoring of the tests and the recording of the results in Folders 
B and D, series R, which should be in the hands of each 
teacher. The children in grades 5 and 6 and usually grade 4 
can score one another's papers. The papers of children 
in grades 2, 3, and backward 4th grade classes can be scored 
and recorded by the children of higher grades. Answer 
cards are provided for this purpose. 

After the median rate of reading and the median number 
of questions correctly answered are determined, the index of 
comprehension, which is " the relation the difference between 
the right and wrong answers bears to the right answers, " is 
estimated. The method of determining this index of com- 
prehension is thoroughly outlined on pages 2 to 5, Folder D. 
Each child's scores are then tabulated on a class record 
sheet. Such a record sheet for a 6-B grade in a city 
school system which was tested in January, 1919, is shown 
on pages 128 and 129. 
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Index of Comprehension 

Class Recokd Sheet (Couxtis) 

Front 



City 




School 






Teacher 


MissH. 








Room 




Grade 6-B. 


Date 1-29-19 




Method 





















Table i 
Rate of Reading 




Table 2 
Standards 




Number 

of 

Children 

Making 

Each 

Score 




Words per Minuti!. 


Score in 
Words 

Minute 


Grade 


S 


8 


4 


6 


6 


7 


8 


























Over 


Starch 


108 


126 


144 


168 


192 


216 


*^ Mf\ 


400 






240 


Gray 


90 


138 


180 


204 


216 


228 




380 






240 


360 






This Test 


84 


"3 


145 


168 


191 














340 


My City 
















320 








My Class 








186 






300 








280 










260 






Questions in Five Minutes 


240 


S 




Grade 


S 


8 


4 


6 


6 


7 


8 


220 


4 




200 


2 




Standard 


16 


24 


30 


37 


40 






180 


4 




My City 
















160 


I 






My Class 










28 






140 


2 






120 


4 








100 


3 




Index of Comprehension 


80 






Grade 


S 


8 


4 


6 


6 


7 


8 


60 










Standard 


59 


78 


89 


93 


95 






40 








20 






My City 































My Class 

t 








89 






Total 


25 








j 




Median 


186 
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Index or Compsehension 

Class Record Sheet (Couktis) 

Back 



„„„». 






";EE^" 


Stawsred 


Total 


-a 


-M 


•-W 


^ 


Tl>-T» 


^ 


tui 


|»* 


,„. 


100 


70 
























65 






















,3 


60 






















S 


S5 






















8 


50 


I 




















1 i 


45 
























40 


5 












1 


111 




1 




35 


3 














1 


1 




1 


30 


i 








1 






1 






1 § 


25 


9 










1 




1 


11 


II 


111? 


,20 


4 












11 


II 






I» 


- 




















1 
























:' 


5 






















s 

























S. 


Total 


S5 








J 


I 


3 


8 


3 


3 


6 3 



























Median Number of Last Question Answered 2 
Median Index of Comprehension 8g. 
Total Number Taking Test 25. 
Number Marked I. N. F. — . 
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Interpreting and Using Results. — The preceding record is 
read as follows : The class is slightly below the standard for 
the grade in the rate of reading but considerably below the 
standards for the grade in number of questions answered and 
in the comprehension of thought. The rate of reading ranges 
from IOC words to 240 words in 3 minutes, a range which is 
entirely too great and which indicates a large nmnber of slow 
readers and likewise a large number of poor readers. 

From this same record sheet (back) the scores are classified 
according to the number of questions answered and the index 
of comprehension. An analysis of this classification shows 
only two children with satisfactory rate and comprehension 
and an additional four children with satisfactory rate and 
poor comprehension, making a total of six children out of 
twenty-five with a satisfactory rate. The record shows also 
seven children with satisfactory comprehension and poor 
rate, and an additional twelve children with poor rate and 
poor comprehension, making a total of nineteen children un- 
satisfactory in the rate of reading. 

The standard scores for the Courtis Silent Reading Test 
No. 2 are as follows : 



Grade 



Words per minute . . . 
Questions in five minutes . 
Index of comprehension . 



II 


m 


IV 


V 


84 


113 


145 


168 


16 


24 


30 


37 


59 


78 


89 


93 



VI 



191 

40 

95 



In order that all of the teachers in the above school could 
profit from the results of these tests, the following table, which 
compares the scores of each class with the respective grade 
standards, was prepared by the principal of the school and ex- 
plained in detail to the teachers in conferences called for this 
purpose. 



The Measurement of Reading 



131 



Table 33 



Gradb 


No. OP 
Children 


No. 07 Words 

Read per 

Minute 


No. OF Questions 

Answered in 

5 Minutes 


Index of 
Comprehension 


4"B . . 


6l 


108 


25 


69 


4-A . . 


27 


149 


24 


84 


Standard 




145 


30 


89 


5"B . . 


32 


137 


24 


86 


5-B . . 


26 


171 


28 


88 


5-A . . 


17 


122 


28 


94 


5-A . . 


12 


157 


32 


84 


Standard 


• 


168 


37 


93 


6-B . . 


25 


186 


28 


89 


6-B . . 


29 


169 


33.5 


92 


6-A . . 


18 


154 


35.5 


92 


Standard 




191 


40 


95 



This table is read as follows : In the 4-B grade there were 
61 pupils. In this class as many pupils made a score above 
108 words per minute and 25 questions answered in 5 minutes 
as below these scores respectively. The index of com- 
prehension is 69. This grade is therefore below the standard 
in all three scores. 

If the class scores are compared with class standards it 
will be seen that the comprehension scores with the exception 
of the 4-B grade (Index of comprehension 69 Standard 89) 
are more regular and nearer the class standards than the class 
scores in the other two points. The teachers of this school 
report : " The school as a whole measured more nearly to 
standard in comprehension than in the other two items. 
It made the poorest record in the number of questions answered 
in five minutes. Not a single grade reached the standard. 
Three fifth grade classes and one sixth grade fell below the 
standard for the fourth grade." 



132 



How to Measure 



An analysis of the individual papers made by the teachers 
is presented in the following summary : 



At or above standard in all respects . . . 
At or above standard in speed of reading . . 
At or above standard in number of questions 

answered 

At or above standard in index of comprehension 
At or above standard in no respect .... 




Percentage 



3.6 
26.4 

22.8 

30-1 
43.5 



The outstanding fact in this summary is the 3.6% of children 
scoring above and 43.5% of the children scoring below the 
standards for all three points. 

A further analysis of the individual papers shows that 
" four pupils in the s-A grade and four in the 4-B grade 
showed negative indices of comprehension. They wrote 
more incorrect than correct answers to the questions. They 
not only failed to get the correct meaning from what they 
read but obtained one absolutely opposed to the correct 
meaning. One of these pupils has been tested by the psy- 
chological examiner and found to be a normal child." 

Remedial Measures. — The report from the teachers 
continues : " The poor showing in the number of questions 
answered and the almost uniformly better showing in com- 
prehension indicates a desire for accuracy even at the expense 
of speed. Probably it was necessary for the children to read 
each paragraph several times in order to answer the questions 
correctly.'' 

" It would seem that some time exercises in silent reading 
followed by a test for accuracy of comprehension might help 
to overcome this deficiency and I have recommended such 
exercises to the teachers to be used in the place of their 
regular reading lesson. Emphasis is being placed upon the 
time element." 
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In carrying out the above plan the following devices were 
found helpful: 

1. " Expose a paragraph of reasonable difficulty on the 
board for a few seconds. Have the children read silently and 
then have them answer questions revealing their compre- 
hension of the content of the paragraph. 

2. Expose another paragraph of directions on the black- 
board for a few seconds and permit the children to follow 
them out as soon as they get the thought. 

3. Expose a word picture briefly and permit the children 
to draw what they get from the brief exposure. Emphasize 
the time factor in order to show children the relative rates of 
reading. 

4. Let a child begin to read at any point in the story. 
The child who discovers the place first will continue the 
reading.'' 

Oral Reading 

School practice has in the past given an undue amount of 
attention to the training of children in oral reading. It is 
not unfair to say that a much larger proportion of the school 
time allotted to reading is given to oral reading than to silent 
reading and yet, when the child becomes an adult, he will have 
little opportunity to practice it. Oral reading to the adult is 
the incidental means of expression. It is the person gifted 
in expression who makes a great use of oral reading. Oral 
reading should receive proper attention in the classroom not 
only as a means to successful silent reading, but also for the 
use that will be made of it by all individuals occasionally 
and a few individuals widely. It should not, however, be 
taught at the sacrifice of silent reading. 

Gray's Oral Reading Test 

Aim. — The aim of Gray's Oral Reading Test is to determine 
accurately the extent of the child's mastery over the mechanics 
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of reading. This is shown by the rate of his reading and the 
accuracy with which he reads. The rate is determined by the 
number of seconds it takes to read a given paragraph. The 
accuracy is determined by the number of errors made in 
reading a paragraph. Six kinds of errors are noted, namely, 
complete mispronunciation, partial mispronunciation, — 
omissions, substitutions, insertions, and repetitions. 

Description of Test. — The test consists of twelve para- 
graphs intended for grades one to eight inclusive. Each 
paragraph increases in difficulty over the preceding one by 
equal steps which have been scientifically determined. 

Giving the Test. — Complete instructions for giving the 
tests are found on the back of the score sheet, which must be 
in the hands of each teacher using the test. These instructions 
should be rigidly followed. No teacher should attempt to 
examine her class before she has completely mastered the 
instructions for giving the tests and scoring the results, and 
until she has had some practice through the examination of 
two or three children. The tests can be given to only one 
child at a time and then not in the presence of the other 
children. There should be no interruptions. For this reason 
the test takes a much longer time for its appUcation than is 
required for most tests. One teacher reports that it took 
her three hours and forty-five minutes to test a class of 
twenty-five children. This time was distributed over a 
number of days. The test was given after school, at noon 
periods and during the regular school hours in another room 
where there could be no interruptions. In the selection of an 
appropriate time for giving tests care should be taken to see 
that normal working conditions for the child prevail. This 
same teacher also reports that : " The children loved this 
test. I have never seen them any happier than when they were 
reading it for me. They liked the easy paragraphs because 
they were easy and they thought it was great fun to try to 
pronounce the difficult words in the more difficult paragraphs." 
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From the preceding quotation it is evident that the success 
with which the tests are used by a teacher depends upon the 
spirit with which she approaches her work and the accuracy 
with which she follows instructions. As the child reads 
from one copy of the test, the teacher follows on another 
copy and marks the errors as indicated on the author's instruc- 
tion sheet. 

Scoring Results. — The instructions for scoring the results 
are simple. The time taken to read each paragraph can be 
recorded in seconds on the left-hand margin of the test sheet ; 
the number of errors made in reading each paragraph can be 
recorded on the right-hand margin of the test sheet. The 
score for each paragraph is determined from the number of 
seconds and the number of errors according to the following 
key provided by the author on the score sheet : 



Seconds 


Euou 







1 


1 


a 


4 


■ 


e 


T 


Or More 


40 or more .... 






S 


2 












30-39 






i 


2 




I 


I 







25-19 






.1 


2 




1 


I 







20-24 






^ 


.=1 




1 


1 







ig or less 






4 


3 




' 


' 


° 





"The numbers in the left-hand column refer to the number of seconds 
required to read a paragraph. The numbers io the horizontal line at the 
top of the table refer to the number of errors made in reading. The 
numbers in the horizontal line to the right of 40 mean that if a paragraph 
is read in 40 or more seconds with no errors a credit of 4 is given ; with i 
error, a credit of 4 ; with 2 errors, a credit of 3 ; with 3 errors a credit of 
2 ; etc" 

The following is an actual reproduction of a child's mis- 
takes in reading paragraph IV and the teacher's scorings : 
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IV 
Once there lived a king and queen in a 
large palace. But the king and queen were not 

was 

happy. There were no little children in the 

house or garden. One day they found a ^ errors 

94 poor little boy and girl at their door. They (score is 

seconds took them into the beautiful palace and ®^ 

made them their own. The king and queen 
were then happy. 

It took him 94 seconds to read this paragraph and he made 
five errors. By referring to the key, it will be seen that the 
score for reading a paragraph in 40 or more seconds with five 
errors is zero. Therefore, his score on paragraph IV is zero. 
His record on all of the paragraphs is as follows : 

Paragraph I 36" errors 5 score i 

II 48" "4 " I 

III 86" "3 "2 

IV 94" "5 " o 

V 180" "7 " o 

A record of 86 seconds with 3 errors entitled him according 
to the key to a score of 2 for paragraph III. The record of 
48 seconds with 4 errors entitled him to a score of i for para- 
graph II. The record of 36 seconds with 5 errors entitled 
him to a score of i for paragraph I. The teacher then entered 
his score for each paragraph opposite his name on the score 
sheet. 

Below is given the exact record and also pupil score for 
each pupil in a class of 25 children in a 2-A grade of a city 
school system which was tested with Gray's Oral Reading 
Test. 
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Omi. Be«ddjc Recokds 


Pdpil 






P«*.«, 


™ 


N^e 


s« 


Age 


Nstioiidily 








t 


6 


' 


B 


PupiiSmre 


I, M,J. . . 


F. 


7 


Swedish 








4 


4 


4 


J 


67^ 


2. A.N. . . 


1'. 




Swedish 








4 


4 


2 




62J 


3. C. Mc. . . 


F. 


7 


Irish 








4 


4 


1 




6ii 


4. E.N. . . 


F. 


Q 


Swedish 








3 


4 


2 




6ii 


5, M. W. . . 


F. 


8 


Swedish 








4 


4 






60 


6. C.J. . . 


F. 


6 


Norwegian 








4 


3 


1 




60 


7. A. P. . . 


F. 


8 


Swedish 








,1 


4 






58I 


8. A. R. . . 


F. 


7 


Norwegian 








4 


4 






58} 


g. E. H. . . 


F. 


7 


Norwegian 








4 


1 






57i 


10. M. K. . . 


F. 


S 


Polish 








1 


4 






S7i 


11. M.Mc. . 


F. 


7 


Irish 








2 


3 






SS 


12. E. H. . . 


M. 


7 


Swedish 








2 


; 






55 


13. E. 0. . . 


F. 


8 


Norwegian 








3 








53I 


14. P. M. . . 


F. 


8 


American 








3 








Sii 


15. R,T. . , 


M. 


7 


Polish 




A. i 






1 






-sii 


16. H. H. . . 


F. 


7 


Swedish 


















48f 


.7. E. P. 


F. 


7 


Swedish 


















46i 


IB, K. K. . . 


M, 


S 


Polish 


















4ii 


19. IL H. . . 


M. 


8 


Norwegian 


















3ii 


30. L, H. . . 


M. 


7 


Norwegian 










I 


I 






^61 


2.. L.T. . . 


M. 


7 


Polish 




















22. CO. . . 


F. 


7 


Norwegian 


















i6i 


as. J. K. . . 


M. 


7 


Polish 


















i3i 


24. 0. R. . . 


M. 


8 


Assyrian 


















.3i 


25. M. M. . . 


F, 


lo 


Polish 














10 




1^1 


Total scores . 








83 


S^ 


76 


7' 


48 


44 


~ 




Average class sixi 








. . 45-8 



In this table the initials of the children are given together 
with the sex, age, and nationality. It is read as follows: 
M. J., girl, seven years old of Swedish descent made a score 
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of four on paragraphs one to seven inclusive and a score of 
two on paragraph 8 ; A. N., a girl, seven years old, of Swedish 
descent made a score of four on paragraphs one to six inclusive 
and a score of two on paragraph seven, etc. 

After each individuaPs paragraph score is determined the 
pupiFs score for the test is found as follows : Multiply the 
score on paragraph I by 55 if in grade i, 35 if in grade 2, 
30 if in grade 3, 25 if in grade 4, 20 if in grade 5, 15 if in grade 
6, 10 if in grade 7, 5 if in grade 8. Multiply the scores on 
each of the other paragraphs by 5. The sum of these prod- 
ucts divided by four gives the score on the test. 

Since M. J. was in the 2--A grade, her score for paragraph 
I is multiplied by 35 and each of her scores on the other para- 
graphs by 5. The sum of these products divided by 4 will give 
67.5 according to the following process : 



Paragraph 



I 

II 

Ill 

IV 

V 

VI 

VII 

VIII 

Total product 



Score 



4 
4 
4 
4 
4 
4 

4 

2 



Value 



35 
5 
5 
5 
5 
5 
5 
5 



Product 



140 
20 
20 
20 
20 
20 
20 
10 



270 



Process : Total product 270 divided by 4 equals 67.5 — score for M. J. 



The average class score is found in the same manner with 
the exception that the sum of the individual paragraph scores 
is used instead of the individual paragraph scores. Referring 
to Table 34 it will be seen that the sum of the scores on para- 
graph I for this class of 25 children is 83, for paragraph II 
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84, for paragraph III 76, etc. Since the pupils who made 
scores are in the 2-A grade, the sum of their scores for para- 
graph I is multipKed by 35 and the sums for the scores for the 
other paragraphs are each multiplied by 5 according to the 
following process : 



Pa&agraph 



I 

n 

m 

IV 

V 

VI 

-VII 

VIII 

Total product 



Score 



^2> 
84 
76 

71 
48 

44 
10 



Value 



35 
5 
5 
5 
S 
5 
5 
S 



Product 



2905 
420 
380 

355 
240 

220 

50 
10 



4580 



The average class score is secured by the following process : 
4580 divided by 100 (No. of children X 4) equals 45.8, average 
class score. 

Interpreting Results. — The value of a test of any kind 
lies in the use that is made of it. Consequently, it is of the 
greatest importance not only that the test be given accurately 
but also that the results be used widely. To this end it is 
necessary that teachers be able to interpret their results and 
know how to use them. 

One of the first problems for the teacher is to compare 
the results of her class with scores from classes in the same or 
other cities. Gray's Oral Reading Scale has been used in a 
number of surveys which have given considerable data for 
comparative purposes. The following table ^ gives the teacher 
an opportunity to make such comparisons : 



1 Courtis, The Gary Survey, "The Measurement of Classroom Products," 
p. 272. 
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Table 35 



G„.. 


I 


n 


UI 


!V 


V 


VI 


VU 


VIII 


Gary, actual averages . . . 




37 


16 


.10 


.!<» 


41 


42 


41 


23 Illinois cities 




io 


27 


40 


44 


4=5 


47 




Cleveland 




43 


46 


■17 


4S 


40 


47 


48 


Grand Rapids 




44 


47 


40 


SO 


4« 


4« 


48 


St, Louis 




47 


■io 


=^2 


^i 


St 


^i 


■^I 


Gray's Standard 


^i 


43 


40 


47 


48 


49 


47 


48 



This table is read as follows : On Gray's Oral Reading Test, 
the second grade pupils in Gary made an average score of 27 ; 
in 23 Illinois cities, 20 ; in Cleveland, 42 ; etc. 

Since the average score of the second grade class reported 
in Table 34 was 45.8, this class scored higher than Gray''s 
Standard and every city score except St. Louis. Such 
comparisons give the teacher information which enables her 
to base her practice on scientific facts rather than on opinion. 

The teacher of this class says in this connection : " I expected 
it to come out that way because I think this class as a whole 
is doing good work in oral reading. Some of the children 
are very unusual readers, and there are not so many poor 
readers." The test in this class also reveals the fact that 
there are a few children who are very poor oral readers and 
the extent to which they are below the average for the class. 
It, therefore, becomes a means of dividing a class of students 
with reference to their ability. It is here that the test reveals 
its greatest value. While it is important to know just where 
a class stands with reference to average ability in a certain 
subject, it is far more important to know the attainment of 
each student in that particular subject. In this way prac- 
tice can be so regulated that it meets the needs of eadi in- 
dividual and does not result in failure to both teachers and 
students. It often happens that the teacher does not form a 
correct judgment of a child's abihty. This is illustrated in 
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thecase of E. N. (Table 34), about whom the teachermakes the 

following report : " E made a high score in this test and 

I think the test was valuable for that reason in that it showed 

me how much E really can do. The children don't look 

pleased in class when it is E 's turn to read because she 

reads in such a monotonous way although I have worked very 

hard with her. One would never give E credit for being 

one of the best oral readers, but she has proved by this test 
that she does know the words and the mechanics of reading." 

Again the tests will determine accurately the best readers 
in the class. Concerning the five students (Table 34), who 
made the best scores the teacher reports : " These are my best 
readers. The test proved this very accurately." 

The report of the teacher also reveals the fact that too 
much care cannot be exercised in seeing that normal con- 
ditions surround the child when the test is taken. If the 
child is interrupted or if it is made to feel that undue im- 
portance is attached to the result, nervousness may greatly 
hinder a true statement of the child's ability. In the case 
of C. O. (Table 34), who made a score of 16.25, ^^e teacher 

reports : " C made a poor score ; ... we all love to hear 

C read and I consider her a good oral reader. I think 

she seemed a Uttle nervous for fear she wouldn't do as well 
as the others and she made so many little mistakes, which 
brought her score down and which she seldom does in school." 
The teacher also makes the same explanation for the low score 

of L. H. She says : ^^ L made a poor score but he is one 

of the best oral readers in my room. He was so anxious to 
excel, and I think that made him nervous, for one would expect 
him to stand at the head instead of at the foot." This shows 
the need to have the tests given under normal conditions. 

Using the Results. — Permanent progress resulting from the 
use of tests will depend upon the use that is made of them. 
Consequently, careful attention should be given to the follow- 
ing work : 
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First, the test should be given at the beginning and at the 
end of the term so that time and energy of pupils and teachers 
are not wasted in finding out what children can and cannot do. 

Second, a graph of each individual score should be kept 
in a convenient place so that each child can see his standing 
in relation to his classmates. The following is a convenient 
graph to use in connection with class results : 



" -H oJB. ; : y.<. rH* 



5.10U2O2530354O456O6SQ0e670S0eO 

Fig, 8. — Showing the position of each student in the 2-A grade (Table 3^) 
according to Us oral reading ability. 

Third, the teacher should keep each child's test sheet in 
order that his difficulties in the mastery of symbols in reading 
may be investigated. 

Fourth, the children in the class should be grouped into 
fast and slow groups according to their ability as revealed 
by the test. 

Fifth, the wide variability in the achievement of children 
in practically every class calls for serious consideration by 
the teacher in the way of readjustment of class groups, 
special promotions, etc. 

If many of these children (Table 34) are good in the other 
subjects of the grade they should be given an opportunity to 
advance to the work of the third grade. The question should 
be asked : Are not some of these children being held back for 
the slower children ? Numerous cases are on record to show 
that when such children are given an opportunity to advance 
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to a higher grade, they are able to maintain the standard of the 
grade without much diificulty and advance with the class, much 
to the surprise of the teacher. 

Haggerty's Visual Vocabulary Test 

Aim. — This test aims to determine the extent to which 
the children have acquired control of words. It is specially 
helpful to the teacher in the early stages of reading. 

Description of the Test. — The Haggerty Visual Vocab- 
ulary Test is arranged in two parts. The second part is 
treated later in the chapter. The first part is a test for 
grades one and two and is divided into series A and B. 
Each series consists of 30 sight words and 25 phonetic words 
arranged in lines. Each Une bears a number. The words in 
series B are different from those in series A but are of the 
same degree of difficulty in order that a class may be tested 
a second time to determine the amount of progress during a 
certain period of time. These tests are printed in the foUow- 
mg form : 

Pupil's Card 
Sight 

50 come 75 mamma 

one next 

who blue 

she wood 

on rabbit 



55 


your 


«5 


lion 




pretty 




monkey 




yes 




cradle 




too 




naughty 




house 




visit 


65 


here 


95 


hurrah 




has 




pigeon 




bird • 




circus 




put 




picnic 




shall 




pieces 



Ph 


'rOnetii 


• 


50 bit 


IS 


yet 


cow 




trick 


that 




toy 


out 




frog 


fox 




find 


55 stay 


85 


crown 


ran 




chew 


gun 




kite 


h'ke 




snag 


meat 




lace 


65 hand 






ten 






name 






head 






cold 
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Giving the Test. — The apphcation of the test is simple 
so that it can easily become a helpful instrument in the hands 
of the teacher. The following instructions are suflBicient to 
insure good results: 

1. Each pupil is to be tested alone. 

2. Hand the child the Pupil's Card. 

3. Ask him to pronounce each word beginning at the top of 
the column. 

4. Do not help the child in any way. Do not correct mistakes 
or suggest ways of working out a word. Do not suggest that the 
child has seen the word before. Do not seem impatient if the 
child makes an error. Allow a reasonable time for each word, 
and if the child does not name the word correctly, ask him to try 
the next word. 

5. Whenever the pupil fails to speak the word correctly place 
on the Class Record Card a zero opposite the word and in the 
column allotted to that pupil. 

6. Record the age of each pupil to the nearest month, that is, 
6 : 5 for six years five months. 

7. Record boys by letter B ; girls by letter G. 

Scoring Results. — When all the pupils of the class have 
been tested, total the number of zeros or errors on each Kne 
for the class and find the percentage this amount is of the 
total number of scores for each line. If fifteen children are 
taking the test there should be 75 scorings (15 X s) on a line 
of five words. If 5 of these scorings are errors, the percentage 
of error would be 5 -7- 75 or 6f .% In the same way each 
child's percentage of errors is determined. 

" The highest numbered line which the child does with one 
(or no) omission or error is taken as his score." 

The score for the class will be the number of the line which 
has 20% of errors. " If no single line gives exactly 20%, 
the actual class score will be intermediate between the two 
lines which give nearest 20% of error." (See form 9, series 
B, Author's Directions for Giving and Scoring Tests in 
Reading.) 
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Interpreting the Results. — The teacher giving the Haggerty 
Visual Vocabulary Test should have no difficulty in interpret- 
ing the results of her test and in applying them to her practice 
in the classroom. The results are usually pronounced, due to 
the fact that the progress of children in the mastery of words 
is rapid if proper methods are employed. The results of the 
work of j&ve teachers in the first and second grades of a city 
school system are reported in Table 36, which is read as follows : 
Thirty-five children in the i— A grade of the Monroe School 
made a score of 37.3 in February and 44.9 in May on sight 
words, which is a growth of 7.6 points ; twenty- two children in 
the 2-B grade of the same school made a score of 50.7 in Feb- 
ruary and 76.2 in May, which is a growth of 25.5 points, etc. 

Table 36. — The Results of the Haggerty Visual Vocabulary 
Test Given in Grades I and II in Two Schools in February 
and Again in May, 1918 

Sight Words 

Monroe Bryant 



Grade 


Enroll- 
ment 


Score 
Feb. 


May 


Dir. 


Grade 


Enroll- 
ment 


Score 
Feb. 


May 


Die. 


I-A . . 

2-B. . 
2-A . . 
2-A . . 


35 

22 

15 
13 


37-3 

50.7 
69.0 

75.8 


44.9 

76.2 
89.7 

73-9 


7.6 

25.5 
20.7 

-1.9 


I-A . 
I-A . 

2-B . 
2-A . 
2-A . 


21 

34 
32 

31 
10 


42.4 

29-3 
56.0 

82.7 

81.3 


52.4 

39.1 
67.6 

92.7 

84.8 


lO.O 
9.8 

II.6 

lO.O 

3.5 



Phonetic Words 



Grads 


Enroll- 
ment 


Score 
Feb. 


May 


Die. 


Grade 


Enrot.t*- 

MENT 


Score 
Feb. 


May 


Dip. 


I-A. . 
2-B . . 

2-A . . 
2-A . . 


35 

22 

15 
13 


29.3 

46.9 
66.1 
68.9 


50.2 

79. 

87.4 
71.7 


20.9 

32.1 

21.3 

2.8 


I-A . 

I-A . 

2-B . 

2-A . 
2-A . 


21 

34 
32 

31 
10 


39.2 
30.2 
46.6 

77. 
70. 


58.5 

38.4 
70.7 

87.9 

77. 


19.3 
8.2 

24.1 

10.9 

7. 
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The column of differences shows a wide variation in the 
amount of growth, which is due to the teaching, the mental 
condition of the class, changes, etc. The principal under 
whose supervision these tests were given reports : " The 
least progress was made in the grades in each school where 
the teachers were changed during the semester. Their scores 
furnish a mathematical statement of the loss caused by such a 
change." 

If the results of these tests as reported in Table 36 are 
represented in graph form, a better idea of the amount of 
growth in each class can be more readily secured. 



Si^ht Words 



Fhonetio Words 



100 
80 
60 

40 
20 

















y 








fy> 








V 






;»'- 























rf" 


^ 






1^ 




• 

















1— B 1-A 2-B 2-A 

May Marks 



1-B 1-A 2-B 

Febrvxirv Marks 



2-A 



Fig. 9. — The amount of growth in the mastery of sight and phonetic words 
in grades I and II of two schools from February to May, 19 18. 

The graphs show that the classes have made approximately 
a half year's progress. The Monroe School occupies a slightly 
better position in May than it did in February. 

Using the Results. — The use which can be made of these 
tests and the extent to which they can affect classroom prac- 
tice is best told in the words of one of the teachers who gave 
the test. " The fact that so many children failed to recognize 
such words as name, head, think, here, yes, shows that more 
drill should be given to words of that type." " An interesting 
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feature of the test was the manner in which different pupils 
gave the words. Those who were sure of the words gave them 
rapidly. Many knew the phonograms, but failed to enun- 
ciate them properly. Some who read well in their books 
failed to recognize the words because they were isolated." 

"After studying the results I decided to adopt a more ejffident 
method of teaching phonics. 

" I. I reviewed carefully the basic facts of phonetics and made 
plans for a more complete course of phonetic instruction. 

" 2. I determined to give more attention to individual instruc- 
tion in the mechanics of reading. The number of concert recita- 
tions was limited, and the classes were grouped according to 
weaknesses along important lines. I found that some children 
needed more drill on the basic phonograms, and others on proper 
enunciation. 

"3. At the suggestion of the primary supervisor a system of 
checking up daily the individual errors was tried out by means 
of the following chart : 

Phonic Chart 

J i I « 8 s 



U 3 



% 



I 



& 



i 



Phonosrazns 

AY 

AKE 

AliE 

ADE 

AIN 

AYE 



JOO 




100 


100 


100 




100 




100 


100 


100 


























V 






























n/ 


























V 






















V 































" As each child pronounced the phonograms on the left side of 
the chart his record was marked in the column with his name at 
the top. If no errors were made, one hundred was placed under 
his name ; if errors were made a check was placed opposite the 
phonogram. 
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" In conclusion, the tests emphasize the importance of indi- 
vidual help in teaching phonics. It has brought to me very 
strongly the fact that individual help in all other subjects as 
well is equally important." 

Corrective Measures. — Many reading recitations would 
be more profitable if they were preceded by a study period 
in which the children followed a definite assignment. Too 
often a class is hastily told to read seven pages or the next 
story and be able to tell what has been read. This is wholly 
inadequate, leading to a hurried and aimless sketching in 
which the child unconsciously omits or neglects the very 
things he should master and spends his time on the easy and 
entertaining parts because he has not been taught to analyze 
the difficulties. 

A definite assignment is necessary to insure a profitable 
study period. The nature of the subject matter and the 
status of the class determine the kind of assignment that will 
be most effective. 

The following methods have been found helpful : 

1. A word and phrase drill should precede the study 
period with emphasis on the time factor ^ — quick recognition 
is the important thing. Pupils may practice during the study 
period to reduce the time it requires them to recognize these 
lists. Group work may be done. 

2. Questions and directions on the board will lead to a 
study of thoughtful interpretations. For example, "In 
which speech is John angry?" "Is he slightly vexed or 
very angry? " " Show by your reading which you think he 
is." 

3. A short enunciation drill may be assigned for study be- 
fore class. 

4. Individual assignments to correct individual difficulties 
may be given. 
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Other Tests 

The Kansas Silent Reading Tests devised by Dr. F. J. 
Kelley ' are made up of a graded list of paragraphs or exercises. 
These tests preceded and are not unlike the Monroe 
Standardized Silent Reading Tests, The Monroe Tests have 
incorporated those features of the Kansas Silent Reading Tests 
which have proved satisfactory. There are three tests: 
No. I, for grades 3, 4, and 5; No. 2, for grades 6, 7, and 8; 
No. 3, for grades 9, 10, 11, and 12. Each test is divided into 
two forms, which are different in subject matter, but equal in 
difficulty, so that the same grades can be measured at different 
times. The tests are intended to measure speed and compre- 
hension in silent reading. The scores on these two points are 
given in one mark. The score in speed and comprehension 
for each exercise is recorded on the left-hand margin of the 
test sheet. The child's abihty in silent reading is, therefore, 
the sum of the values of the exercises which he reads correctly 
in five minutes. 

These tests have been widely used so that standards are 
available. These standards are given in terms of the median 
and percentile scores as indicated below : 



(^«n 


III 


IV 


V 


VI 


vn 


vm 


IX 


X 


XI 


xn 


Twenty-five percentile 
Median score . . . 
Seven Ly-five percentile 


S.2 


9-5 


9-4 
■-3-0 

i?-5 


9-4 
13.0 
iq.S 


11.8 
16.2 

2tlQ 


'3.7 
26.4 


16.0 
22.Q 
^04 


t7-9 
25.6 


18.7 

26.5 

33-1 


3S-3 

ag.7 
34-1 



The Fordyce Scale for Measuring the Achievements in 
Reading, devbed by Dr. Charles Fordyce,' is intended to 
measure the speed and quality (comprehension) of silent 
reading. The scale is divided into test No. i, designed for 
grades 3, 4, and 5, and test No. 2, desigaed for grades 6, 7, 
• See Bibliography. 
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8, and 9. The legend of " Narcissus " is the selection for test 
I and " The Spirit of Spring " the selection for test II. The 
rate of reading is determined by the number of words read 
at the expiration of three minutes for Part I and five minutes 
for Part II. The extent of the pupils' ability to understand 
is determined by answering certain questions on the entire 
selection for which ten minutes and fifteen minutes are given 
respectively for Tests I and II. In order that all the children 
may have the same information, they are given an opportunity 
to finish the part which they have not yet read. Below are 
given the standards in percentages for the test : 

Test No. I, designed for Grades III, IV, and V. 



Grade 


m 


IV 


V 


Speed 

Quality 


90 

57 


95 

71 


100 
74 



Test No. II, designed for Grades VI, VII, and VIII 



■ 

Grade 


VI 


VII 


vra 


Speed 

Quality . . . . . 


90 

73 


100 

75 


100 
76 



This test is arranged in convenient form so that the teacher 
can give it to her class quickly and efiiciently. 

It gives the reading ability of children in terms of speed and 
quality (comprehension). The score sheet is arranged so 
that the ability of each child can easily be seen. The scale 
becomes, therefore, an important instrument in the hands 
of the teacher to diagnose the needs of her class. It is receiving 
wide use throughout certain sections of the coimtry. 
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Brown's Silent Reading Tests, constructed by H. A. 
Brown,^ are made up of easy reading selections to be used in 
grades 3 to 8. They are intended to measure speed and the 
quantity and quality of comprehension. After the children 
have read as much as they can in exactly one minute they 
are asked to write as much as they can remember of what 
they have read. A key is used to determine the quantity 
and the quality of what has been reproduced. In addition 
to the scores in speed and comprehension, the child's ability 
is given in terms of one mark called " reading efficiency," 
which is obtained by multiplying the scores in speed and 
comprehension. The following are tentative scores so far 
available for the test: 





Words per Second 


Comprehension 


Reading Efficiency 


Grade in 


3.32 


46 


127.8 


Grade IV 


3-55 


65 


217. 1 


Grade V 


4.40 


61 


291.0 


Grade VI 


4.54 


68 


295.0 


Grade VII . . . . 


4.6s 


78 


322.3 


Grade VIII . . . . 


4.84 


79 


323-6 



The tests are useful in determining the reading abiUty of 
individual children on account of the fact that the different 
factors — speed, quantity, and quahty of comprehension — 
which are necessary for good reading can be determined. 

The Eaggerty Visual Vocabulary Tests by Dr. M. E. 
Haggerty are a " slight modification " of the Thomdike 
Visual Vocabulary Tests with the addition of an oral test 
(Part I) for grades I and II which is discussed under Oral 
Reading. 

Part II is a hst of words called " Scale R " for grades III 
and IV and another list of words called " Scale R 2 '' for grades 
V, VI, VII, and VIII. 

1 See Bibliography. 
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The Thomdike Reading Scale, Word Knowledge or Visual 
Vocabtdary, by Dr. E. L. Thomdike is intended to deter- 
mine the extent of a child's knowledge of words. 

The scale is divided into two divisions, Scale A-2 and Scale 
B. Each scale has two series, X and Y. Each series is made 
up of a graded list of words which increases from simple words 
familiar to almost any child with two or three years in school 
to less familiar words which school children seldom meet. 
The list of words for each series and for each scale is of 
equal degree of difficulty for the purpose of testing the same 
children more than once to determine the extent of progress. 
These tests are intended for grades III to VIII. 

Gray's Silent Reading Test by Dr. Wm. S. Gray ^ is made 
up of the following selections for grades as indicated : 



Grades II and III 
Grades IV, V and VI 
Grades VII and VIII 



Tiny Tad 

The Grasshoppers 

Ancient Ships 



The test is used to determine the rate and quality of 
pupils' reading. Quality is " based on the ability of pupils 
to reproduce what was read and to answer specific questions 
concerning the subject matter of the test." Only one pupil 
is tested at a time. This test has recently received consider- 
able use in the St. Louis survey. On account of the large 
amount of time required to give this test, its use is limited in 
the hands of classroom teachers. The standard scores for 
this test are as follows: 



Grade 


II 


III 


IV 


V 


VI 


vn 


VIII 


Rate (words per second) 
Quality 


1.50 
32 


2.30 
37 


2.20 

29 


2.57 
32 


2.79 
39 


2.69 
22 


2.87 
27 



See Bibliography. 
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Ackievement Examination in Reading: Sigma i by Dr. 
M. E. Haggerty and Margaret E. Noonan is intended to 
measure the reading ability of children in grades I to III 
inclusive. It was devised in 19 19 in connection with the 
Virginia school survey. In addition to its use in the state 
of Virginia, it has been used in St. Louis, Cincinnati, Madison, 
Bloomington, and Aberdeen. 

This reading test is in two divisions in one pamphlet. 
Test I and Test 2. Test 2, which consists of a series of ques- 
tions to be answered by no or yes, as " Can you eat ? No — 
yes; Can a bat walk? No — Yes," etc., should be given 
first. Test i consists of a series of performances, as " Put a 
tail on this pig." (Picture of pig without tail given.) A 
Manual of Directions by Dr. M. E. Haggerty contains ex- 
plicit instructions for this test and also for Intelligence 
Examinations: Delta i and Delta 2. This manual should 
be in the hands of every teacher giving the test. 

The results from this reading test may be secured in terms 
of scores for grades and age groups. The following grade 
standards and age norms are available: 

Table 37. — Grade Standards for Achievement Test in Reading. 

Sigma i 



Grade 


I 


II 


III 


IV 


e J Test I 

Score rr. ^ 

I Test 2 


4 
2 


12 
8 


16 
14 


20 

18 



Table 38. — Age 


Norms for Achievement Test in 
Sigma i 


Reading. 


Age in Years 


7 


8 


9 


10 


II 


e f Test I . . 
Score rj. . 

I Test 2 . . 


• 
• 


6 

4 


12 

7 


IS 
12 


18 
IS 


24 
19 
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The authors of these tests have overcome the difficulties 
encountered in determining the reading ability of young 
children who are unable to follow complicated instructions or 
who have not acquired a wide vocabulary. The nature of 
the subject matter appeals to the interest of the children. 
The simplicity of the tests and the ease with which they can 
be given make them a valuable instnmient in the hands of 
every primary teacher who wishes to know the achievement 
of her children in reading. 

The subject of reading has been receiving marked attention 
from psychologists and other educational experts during 
recent years. The importance of reading in every child's 
development justifies this attention. This interest has resulted 
in sufficient scientific information in the form of standardized 
tests, scales, and standards of achievement which enable every 
classroom teacher to determine accurately the ability of her 
children in reading. Every teacher now in a classroom or 
coming fresh from the training school should be sufficiently 
familiar with a test or several tests so that she can justify 
her instruction in reading by scientific facts as well as by 
opinion. 
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CHAPTER VI 

THE MEASUREMENT OF ENGLISH COMPOSITION 

The importance of written composition in the life of the 
pupil and the indefinite notions of teachers as to proper 
standards in written composition make the need for an 
objective measurement an exceedingly urgent one. The 
difficulty of securing such a measurement in written com- 
position over the measurement of such subjects as spelling, 
arithmetic, penmanship, is greatly increased by the large 
number of factors which make up written composition. 
There are the different kinds of written composition, as 
narration, description, exposition, and argimientation ; there 
are also the factors of the content and such form elements 
as punctuation, spelling, capitalization, and sentence structure. 
All of these difSculties have hindered the construction of a 
scale for written composition which can be used with ease and 
accuracy by the classroom teacher in determining the ability 
of her children in this subject. 

The first scale for written composition was constructed by 
Dr. Milo B. Hillegas.^ It measures general merit in written 
composition. It consists of lo samples, 3 of which are arti- 
ficial, 5 were written by high school students, and 2 by college 
freshmen. The values of the different samples range from 
o to 9.3 with wide steps between samples. This scale has 
shown the need for a written composition scale and formed 
the basis for the construction of other scales which have a 
more direct application to particular grades of work or to local 

^ See Bibliography. 
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situations. Examples of this procedure are found in the 
Nassau County Supplement to the Hillegas Scale for Measur- 
ing the QuaUty of English Composition, proposed by Dr. M. 
R. Trabue ^ and the Extension of the Hillegas Scale for Measur- 
ing the Quality of English Composition of Young People by 
Dr. E. L. Thorndike.i 

The Nassau County Supplement to the Hillegas 

Scale 

Aim. — The purpose of this scale is to determine the general 
merit of children's written composition. No attempt has 
been made by the author to define the different elements in 
general merit. Hillegas in referring to his own scale which 
measures the same thing says, " The term (merit) means just 
that quality which competent persons commonly consider as 
merit, and the scale measures just this quality." 

Description of Tests. — For the purpose of giving the teacher 
a clearer understanding of the scale and its application, the 
entire scale is given below : 

What I Should Like to Do Next Saturday 



I went going on to the Dox Saturdaye dnd day we the 
boys and I well going home and I well going the boys, and I 
will going these read in and they to night, and we or night. 
I well going a ground shalt and I gone out I will going to shea 
shouse and I will shoe or the skill of the shea of night. 

I.I 

I intend to mak a snou man and make an fort and fort 
snou ball at chidem and hau I whist ma frant carolyn cole 
what were me I will going to the mauiss on Saturday. 
Georga will come went me. 

^ See Bibliography. 
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at night I will going out went my mother to the marce. 
I will mak the snou man and the fort in the moning and in 
the af termoon I will go to the mauies. I whist there whest 
school on Saturday. 

1.9 

one next Saturday. I expect to go to the city leve next 
Gaturday to see my ofriend archie king I am going to grow 
to the baning balys circus with hime next Saturday fefore 
I go I have to do my jobs feedsing the cows and horse ard 
chinkens and geese next Saturday. My friend is a very 
good fellow to go and see So my mother Said *'If I do my 
work during Easter week vacation I can go to the barning 
baley circus with, hime 

2.8 

Once a pon a time there was a girl. One day she asked 
me what I was going to do next Saturday so I said, "I am 
going to go for a swim." And she said, "thats 

just where I am going to." next Saterday came we both 
went down together. We came home at noon time, after 
dinner we went to the picktures. There we had a good 
time. And then came home at night. 

3.8 

I would like to go out in the afternoon and play catching 
the ball. Go over to Bertha^s house and have a few girls 
to come with me and be on each others side. I have a tennis 
ball too play with. The game is that one person should stand 
quite aways from another person and throw the ball too one 
then another. Someone has to be in the middle and try 
too get the ball a way from someone then she takes this 
persons place who she caught the ball from. Then till every 
person has a chance. 

5.0 

Next Saturday I should like to go away and have a good 
time on a farm. I should like to watch the men plowing 
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the fields and planting com, wheat, and oats anAother things 
planted on farms. 

Next Saturday I will go to the Pioneer meeting if nothing 
happens so that I cannot go. I should like to go swinmiing 
but it is not warm enough and I would catch a bad cold. 
I should like to go to my aunts and drive the horses, I do 
not drive without some older person with me, so I cannot 
go very often. 

I should like to see my aunts cat and her kittens, too. I 
think I can, to. 

6.0 

I should like to join my girl friends, who are going to the 
city on the 9-05 a.m. train. They are going shopping in 
the morning and will have lunch to-gether, then they are 
going to the Hippodrome. After the Hippodrome, they 
are all going home to dinner to one of the girls houses, she 
lives on Riverside Drive so they expect to take the "Fifth 
Avenue Bus" up there. The evening will be devoted to 
playing games, singing and dancing. 



7.2 

If I had a thousand dollars to spend, I think I would take 
a trip to San Francisco by train with the rest of the family, 
and stop at a sea-side hotel. It would be glorious to see the 
surf again, and to escape from the cold blustering weather of 
December for the balmy breezes of the ocean, and the whiflF 
of orange blossoms. 

We could take long drives under shady trees, visit the 
orange and olive groves and bathe in the surf. Think of 
bathing in the ocean in December. 

Coming home again I would enjoy stopping at Yellow 
Stone Park. It would be lots of fun to camp out, and to 
ride over the prairies on frisky ponies. It would be very 
interesting to notice the change of climate as we got farther 
east, and to go to bed on the train one evening feeling warm, 
and waking up the next morning feeling very chilly. 
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I am afraid by the time I would get home a thousand 
dollars would be pretty well used up; but if not I would 
like to give a party. 

8.0 

One Sunday, towards the end of my sunmier vacation, 
I was in bathing at the Parkway Baths. In the Brighton 
Beach Motor drome, a few rods away, an aviation meet was 
going on. Several times one of the droning machines had 
gone whirring by over our heads, so that when the buzzing 
exhaust of a flier was heard it did not cause very much com- 
ment. Soon, however, the white planes of "Tom" Sop- 
with's Wright machine were seen glimmering above the 
grandstand. Everyone stood spellbound as he circled the 
tract several times and then headed out to sea. He was 
seen to have a passenger with him. Suddenly, the regular 
hum of his motor was broken by severe pops, and the engine 
ran slower, missing fire badly. In response, to Sopwith's 
movements, the big flier tilted and swooped down to the 
beach from aloft like an eagle. The terrified crowd make 
a rush to get out of the way as the airship came on, but 
Sopwith could not land on the beach, but skimmed along 
dose to the water instead. Suddenly his wing caught the 
water, and the big machine somersaulted and sank beneath 
the waves. The aviators soon came bobbing up and were 
taken away in a launch, but the accident wiU not soon be 
forgotten by those who saw it. 

9.0 

The courage of the panting fugitive was not gone; she 
was game to the tip of her high-bred ears ; but the fearful 
pace at which she had just been going told on her. Her legs 
trembled, and her heart beat like a trip-hammer. She 
slowed her speed perforce, but still fled industriously up the 
right bank of the stream. When she had gone a couple of 
miles and the dogs were evidently gaining again, she crossed 
the broad, deep brook, climbed the steep left bank, and fled 
on in the direction of the Mt. Marcy trail. The fording of 
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the river threw the hounds oflF for a time; she knew by 
their uncertain yelping, up and down the opposite bank, 
that she had a little respite ; she used it, however, to push on 
until the baying was faint in her ears, and then she dropped 
exhausted upon the ground. 



The first seven specimens were selected from compositions 
written by children in the elementary schools of Nassau 
County, New York, on the topic, " What I Should Like to Do 
Next Saturday." The last three were selected from a list of 
compositions published by Dr. E. L. Thorndike. The values 
range from o to 9. It is intended for grades 4 to 12 inclusive. 
The compositions are arranged on one sheet with the value of 
each composition printed on the left-hand margin. For the 
sake of clearness in this text the value of each paragraph is 
placed in the middle of the page and above the paragraph. 

Applying the Scale. — After a set of composition papers has 
been secured which has been a part of the regular class work 
or a special assignment on some topic which is familiar to the 
pupils as an Exciting Experience, An Accident, What I shall 
Do Next Saturday, etc., each paper is compared with the 
scale with a view of deciding on the specimen which its 
quality most closely resembles. The scale value of this 
composition is marked on the child's composition. If a finer 
rating is desired than the values as given to each composition, 
units between each value may be used. For example if a 
composition is better than quality 3.8 and not as good as 
quality 5.0 a rating between these two qualities may be given 
as 4 or 4.5 according to the judgment of the teacher. 

Scoring the Results. — As soon as a rating has been given 
to each child's composition the papers are grouped according 
to their values, from the lowest to the highest. The number 
of papers in each group is then determined and the results 
recorded on a record sheet similar to the following : 
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Table 39. — Rbclts ix a School Gkades IV to VI 

Class Recoed Sheet 

City D. School L. Grades 4-B— 6-A 



Teacher or Principal 



Date Jan. 7, 1 



N™-.orS™«» 


GUDBS 












j^- 




n- 


V J M 1 ^^I \-ni 


IX 


X 


XI 


X 




B 


A 


B 


A B 


A 


">:."* 


B A 


B 


A 


B 


A 


B 


A 


0- 

1.9 
3.8 

3-8 

1.1 
8.0 

Q.O 

Tola 
ofp 


^ 

ipe 


0. 


7 
6 

1 

2 

53 


5 

7 

2 

21 


S 
•9 
It 

3 

42 


.6 
M 

43 


It, 
lE 

3 

38 


I 
9 
9 
14 

33 
















Median 
scores . - 


..I 


... 


2.8 


3.8 


50 


S-o 















After the rates for all the compositions are recorded on the 
class record sheet, the median class score is determined. 

The above table is read as follows : In 4-B grade there 
are 23 pupils of which 7 received a rating of o, 6 received a 
rating i.i, etc. The median score for the 4-B grade is 
I.I. 

Interpreting and Using the Results. — The tentative 
standards for the Nassau County Supplement as arranged by 
the author are as follows : 
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Gt.-.m 


T^MTAX. 


VE Median Si*in>ABD 


Fourth 




3.5 
4.0 


Fifth . . 






Sbth . . 






4.5 


Seventh . 






5-0 


Eighth . 






S-S 


Ninth . , 






6.0 


Tenth . , 






6.5 


Eleventh . 






6.9 


Twelfth . 






7.2 



The following table taken from the Nassau County Survey 
shows the scores which have been attained in the cities named : 







Table 41 
















MiDIAH SCOM ArTAINH) IN GSADE 


Sninra. Rvtmii. 






rv 


V 


VI 


Vlt 


vm 


IX 

^.00 


X 

S.2S 


XI 

v68 


Xll 


Nassau County .... 


a.76 


,1-42 


^82 


4..8 


4.S6 


S-q4 


Lead, S. D 


Vil 


4.I1 


4.04 


S.OI 


5-57 










Newark, N.J. (one school) . 


2-39 


?-5i 


3-5(>\4-33 


5-«7 
5.74 










Ethical Culture School, N.V. 




Chatham, N.J 


2.q,' 


2.KS 


4.10 


4.02 


.s.zo 










Salt Lake City . . 




'!.■;« 


V»4 


4.61 


^.16 


6.,^7 










Butte, Mont. . . 




ZvM 


2.80 


S.41 


3-n 


4.11 










South River, N. J. 




Z.^I 


2.,-?,1 


V7S 


47'; 


S.62 


ua 


^.Q2 


■;-<)=; 


6.,lo 


MobUe County, Ala. 










4.^2 




S.=i6 


6.^ 


ios 


i.77 


MobUcAla. . . 




,V3i 


^s^ 


4.60 


4-qS 




S.6q 


6.0.1 


7.24 


7.i4 


54 high schools . . 














400 


5.88 


6.38 


b.bg 



The following table shows the median scores for rating 
compositions of 539 pupils in grades 4 to 6 inclusive for four 
schools in a city school system in January, 1920. 
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Table 42 



School 


IV-B 


IV-A 


V-B 


V-A 


VI-B 


VI-A 


X • • • • 






2.8 


3.8 


3.8 


5.0 


^ • • • • 






2.8 


2.8 


2.8 


3.8 


3 • • • • 


I.I 


I.I 


2.8 


3-^ 


5.0 


S.o 


4 • • • • 


2.8 


2.8 


3.8 


3.8 


3.8 


3.8 



If a comparison is made of the scores reported on the class 
record sheet (Table 39) with the scores in Table 41 it will be 
seen that the written composition work in this school in the 
fourth grade is below all the scores, in the 5-B grade it is 
below all scores except two, in the s-A grade it is exceeded 
by 5 out of 10 scores, and in the 6-B and 6-A grades it 
exceeds all scores. 

It is evident, therefore, that the written composition in 
the fourth grade is exceedingly low, in the fifth grade it is 
also low, but in the sixth grade it is ahead of the work in the 
places with which comparisons are possible. 

The unusually low scores in the fourth grade may be ex- 
plained partially by the large number of foreign children in 
this school. These scores are surely an indication that more 
systematic training in oral and written composition should be 
given in the fourth grade and also in the third grade. 

The scores in Table 42 were obtained from compositions 
which were written as a class exercise under the supervision of 
the classroom teacher. All of the children wrote on the same 
theme, " An Exciting Experience." Twenty-five minutes 
were allowed in which to do the work. The exercise was 
given one week before the end of the first semester. The 
purpose of the exercise was two-fold : first, to determine the 
attitude of the teacher towards the use of such a scale in 
rating composition papers as opposed to the regular method ; 
and second, to ascertain whether or not the scale in con- 
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nection with such an exercise could be used to determine 
promotions. 

The teachers who gave the exercise and scored the papers 
were unfamiUar with the use of the scale. Carefully prepared 
instructions were given to each teacher. 

The results show that in general the composition work in 
these four schools in comparison with the standards is low. 
Judging from the comments of the teachers who gave the test 
and who were asked to give their opinion of its value, the 
constant use of such a scale would improve the composition 
work in these schools. 

The opinion of all the teachers was that " the scale is a 
quicker and more accurate method of grading themes." An 
analysis of the teachers' reports shows very clearly the lack 
of standards as to just what constitutes good written com- 
position work. The prevailing opinion was expressed in the 
words of one of the teachers : " If a standard for composition 
is established for each grade, it will be a helpful guide in 
carrying on the written work." 

The use of such an exercise as a basis for determining 
promotions received general approval. The different judg- 
ments can be summarized in the words of one of the teachers 
who reported as follows : " It seems a fairer and more accurate 
way of judging a child's ability than the customary examina- 
tion." 

Once teachers realize the necessity for more definite 
standards in such subjects as written composition and under- 
stand the use of such a scale as the Nassau County Supple- 
ment and the help derived therefrom, a more scientific 
procedure is assured. The principal of one of the schools in 
which the above results were secured reports as follows: 
" The results of our first use of the Nassau County Supplement 
to the Hillegas Composition Scale seem entirely favorable. 

" A few teachers were at first of the opinion that consider- 
able extra work was involved in the use of the scale. Lack of 
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familiarity with it caused much more time to be taken with 
this work than would be necessary after further use. I think 
that the opinion was changed after the last papers were 
graded.'' 

The Nassau County Supplement to the Hillegas Scale is, 
on account of its simplicity, one of the best written com- 
position scales so far available for use by teachers in the 
elementary schools. It enables them to grade their written 
compositions more accurately and with greater speed than 
the present system of marking. 

Willing Scale for Measuring Written Composition 

Aim. — The purpose of this scale is to measure the " Story 
Value " and also the " Form Value " of the written com- 
position of children in grades four to eight. 

Description and Application of the Scale. — The scale 
consists of 8 compositions ranging in value from 20 to 90 in 
steps ten units apart. " They are all on the same topic, 
An Exciting Experience. The scale is made up of the material 
it attempts to measure." By " Story Value " is meant the 
degree of completeness with which the story in the composition 
is told ; by " Form Value " is meant the " number of mis- 
takes in speUing, punctuation, and syntax per hundred words." 

Full instructions are provided for the use of the scale. 
According to the plan, a class exercise is given on some topic 
as An Exciting Experience, A Storm, An Accident, etc. 
Twenty-five minutes are allowed to write the exercise. These 
compositions are then used as a basis for determining the 
children's ability in written composition by comparing each 
composition with the scale and giving it the value of the com- 
position on the scale which it most resembles. 

Scoring the Results. — The compositions are scored first 
for the value of the story and second for errors in grammar, 
spelling, punctuation, and capitalization. If a composition 
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scores high for the value of the story and low for its form value 
or vice versa, a score between these two values should be 
finally agreed upon. According to the author " no paper is 
marked above 70 which does not have good story value and 
technical excellence, nor is a paper marked below 40 which 
does not lack both of these qualities." 

Below is given a class record sheet showing the scores from 
a 6-B grade in a city school system in January, 1920. 

Table 43. — Class Record Sheet por Willtng's Scale for Meas- 
uring Written Coufosition 
City D School L. P. Grade 6-B 

Teacher A. H. Date Jan. 7, 1920. 



Wo^ 


„ 


„ 


40 


SO 


w 


TO 


so 


M 


Euons 


10 2.g 

3 to S'Q 

6 to 8.1) 

g lo 11-9 .... 

12 to 14.9 .... 

i5toi;.9 .... 
r8 to 10.9 .... 
ai to 23.Q ... - 
24 to 26.9 .... 
87 to 2g.g .... 
Above 30 . . . . 
Distribution for 
"story value" . . 






■ 


6 


5 


3 
S 


i 

1. 


4 
6 


g 
s 

5 



Class medians. 



Form valui; 3.3 



Slory value 70 



This record sheet gives the distribution of the scores of 
this class according to the story value and the form value of 
the different compositions. It is read as follows : The papers 
of nine pupils showed from o to 2.9 errors in " spelling, punc- 
tuation, and syntax per hundred words." Of this number one 
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pupil had a story value of 60, three pupils had a story value of 
70, etc. The median class scores are form value 3.5 and story 
value 70. 

Interpreting and Using the Results. — The following 
standard scores are available with which comparisons can 
be made : 

Table 44. — Median Scores for Wilung's Composition Scale 



Grade 



IV . 

V . 

VI . 
VII 
VIII 



Denver 



Story Value 



32 
43 
50 
60 

63 



Form Value 



22 
16 

14 
II 

10 



Five Kansas Cities 



Story Value 



44 
58 

75 
77 
82 



Form Value 



12 
10 

5 

5 
6 



By comparing the scores on the above record sheet (Table 
43) with the standards (Table 44), it is readily seen that there 
are two children who are very poor in both story value and 
form value. Possibly the teacher was acquainted with this 
fact, but just how much she could not tell unless such a test 
had been given and standards of attainment for different grades 
provided. She knows now that one pupil does not exceed the 
fourth grade standards of the five Kansas cities and the other 
is below the fourth grade standard in form value and the fifth 
grade standard in story value in the same cities. She also knows 
that her class as a whole is above the Denver standards for 
the eighth grade and below the sixth grade standard in story 
value and above the eighth grade standard in form value in 
the five Kansas cities. 

An analysis of the errors on the papers from this class will 
enable the teacher to know the forms which should continue 
to receive drill. She will also know that steps should b« 
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taken to improve the children in grasping and expressing the 
story in such a written theme. 

The plan of judging the compositions first for the value of 
the story and second for errors in form enables the teacher to 
analyze the strength or weakness of her written composition 
work. This being accomplished, she is able better to stress 
those elements in which the class is weak and which are 
essential to good composition. 

In the scoring of compositions as to general merit, when 
errors are taken into consideration, it is possibly better to 
count only certain types of mistakes. The plan of counting 
mistakes in the Gary survey is suggestive. The following 
types of mistakes were observed : 

IN CAPITALIZATION 

1. " Failure to begin a sentence with a capital letter. 

2. Failure to capitalize a proper noun. 

3. The capitalization of common nouns. 

IN PUNCTUATION 

1. Failure to place a period at the end of a sentence. 

2. Failure to place a question mark at the end of a question. 

3. Failure to inclose a direct quotation in quotation marks. 

IN SYNTAX 

1. The use of the wrong case form, as ' me and him went.' 

2. The use of one word in place of another, as ' it would of 
(have) been.' 

3. Lack of agreement of noun and pronoun, as ' the pieces 
were about the size — and it break.' 

4. Lack of agreement between subject and verb as ' they 
was.' 

5. Use of the wrong tense form, as ' seen ' for ' saw.' 

6. Use of the double negative. 
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7. Confusion of dependent and independent clauses, as 
* . . . away, but worst thing was that there were not light 
on the streets and no road but a Uttle path through the wood 
but I dressed up and took my dog and started off we were not 
far from home when my dog his name was Rover began to 
chase after I was a fright to go myself and began. . . / 

8. Omission of words essential to the thought, or the 
addition of irrelevant words, as ' began to chase after (a 
rabbit, omitted) I was a fright to go myself.' " 

In addition to the diagnosis of the language abilities of her 
class and a comparison of the results with standards of 
attainment in other cities, the teacher should also be able to 
compare her class with other classes in the same school or in 
other schools of the same city ; the supervisor should likewise 
be able to know the weak and strong spots in her organization. 

The following table shows the median class scores for 
written composition in grades 5 and 6 in three city schools 
in January, 1920, according to the Willing Composition Scale. 

Table 45 



Story Value 


FoRic Value 


School 


S-B 


S-A 


6-B 


6-A 


S-B 


5-A 


6-B 


6-A 


I . . . 


• • • 


70 


81 


80 


80 


14.2 


8 


7.3 


9.1 


2 . . . 


• • • 


43 


50 


65 


65 


II. I 


10.9 


10.9 


4.0 


3 . . . 


• • • 


80 




70 




8.5 




3.5 





This table is read as* follows : School number one made in 
story value 70 in 5— B, 81 in 5— A, etc., and in form value, 14.2 
in 5— B, 8 in 5— A, etc. The scores for the 6-B grade in school 
number three are also shown on the class record sheet on 
Table 43. 

A comparison of these results with the standards so far 



The Measurement of English Composition 171 

available shows that in story value school number one stands 
well; in form value it is above the Denver and below the 
Kansas Standards. An outstanding fact in this school is 
that the 6^A grade shows more form errors than the 5— A or 
6^B grades, a clear indication of the need for more drill on 
form. School number three is above school number one and 
the other standards in both story and form values except 
in story value for the 6-B grade. School number two is 
below the other two schools and the Kansas cities in story 
and form values; it compares fairly well with the Denver 
standards in story and form values. 

Although the differences in scores may be due partly to 
the scoring of the teachers who were limited in their expe- 
rience in using the scale, it is evident that the written com- 
position work in school number two is weak. More system- 
atic instruction should be given in training these children 
to organize their experiences in outline form and to express 
their ideas orally before undertaking the written work. If 
this preparatory instruction is given before the children are 
permitted to write, the written compositions will show marked 
improvement in the content. Drill exercises on the mistakes 
in punctuation, spelling and syntax should follow the written 
themes. 

The teachers who gave the composition exercise and scored 
the papers to secure the results in the above table were pro- 
vided with all the instructions which accompany the Willing 
Composition Scale. All of the children wrote on the same 
theme, " An Exciting Experience.'' They were allowed 
twenty-five minutes to do the work, which was done as a 
class exercise. 

All of the teachers reported that " it was easily given." 
Many of them stated that the papers were difficult to score 
with the scale on account of the lack of variety in com- 
positions comprising the scale. They agreed that the scale 
in connection with such an exercise was a fairer method of 
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determining promotion on written composition than the cus- 
tomary examination. 

The two outstanding facts as reported by these teachers 
are the following : first, the scale applied to such an exercise 
is a good test of the mechanics and the story value of written 
composition ; and second, the value given to each composition 
on the scale for spelling, syntax, and punctuation is exceedingly 
helpful in directing attention to definite strength and weak- 
ness. 

The Willing Scale for Measuring Written Composition can 
be used by the teacher with advantage to determine the 
ability of her children in written composition. By analyzing 
the written composition work into its " rhetorical and formal 
qualities," instruction can be more carefully directed so 
that more satisfactory progress is assured. 

The Teacher's Composition Scale 

Probably the greatest good which a group of dty teachers 
can derive from the use of a written composition scale will 
come from a scale of their own construction out of the material 
which they intend to measure. For such a purpose, it is rec- 
ommended that the Nassau County Supplement be used 
as a basis, although the teachers may find it helpful to make 
comparison with other scales. 

A simple plan for doing this was followed in the con- 
struction of the Gary Composition Scale. Compositions 
were selected and arranged into a scale by matching them with 
the specimens in the Hillegas scale. The values of the different 
compositions in the Hillegas Scale were then given to the 
corresponding compositions selected from the material to be 
measured. 

For the purpose of indicating to teachers the nature of 
such a scale, the complete Gary Composition Scale is 
given: 
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Sample A. Value 5 

An Exciting Experience 

I was on a wonderful to now-Yeark and it waos and the journey 
that I lecekt so mouch butit the fun I head on the trean but the 
buge-ried in calmer Nou-Yeaork waos the beast thean. and I 
belavef is all that I can thank on so godboy. 

Sample B. Value 15 

The Beaer and a Boy 

Ones their was a littil boy. He went in the weth a gan and a 
hachet and he saw a beaer. The beaer begain to run after the boy. 
And their was a oake tree he stared to clamb the tree. The beaer 
avas after the boy and the beaer stared to clamb the tree. And 
the beaer clambed and clambed so the boy thart he beter get out 
of their he fawed a hoi in the so the got down in the hoi and the 
beaer con't find the boy. So the beaer went on an lime. The 
bay clambed out and chot the lime of and the beaer wos celed. 

Sample C. Value 20 

Once My nother want to voiter a frincd And she take me a 
gave me to my aunt thats ny mother want away. And my aimt 
was choping wood Andl was playing biut a bucter Came and ny 
aent want inside The house and I want and take The hatck and 
was choping the wood with the hatchek and chop my Finger and 
my nother wasnot Home so I uat crying and Than ny mother 
came after There day and I was over ny aunt House I was in 
bed and ny mother was glad I was Well and I was happy again. 
And than I have a mark on My f rienger it is the left hand Frienger 
it is the 2ndfrineger and the Than after my mother take me Home 
and I was a happy girl After But I was till or ny frienger That 
marke stayed ny mother Was helfing my friend and Now I am 
happy girl as Happy as can be happy. 
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Sample D. Value 30 
In The Mountains 

As we went to spent our vacation I happen to be right near the 
mountains I was glad couse I could go and climb just as higch as 
I want for. 

So I went with my father and mother we went pvery hiegh it 
was getting cold already why I think abouve the clouds I want to 
rich the tops but couldn couse there was ice and it was so sleapry 
to goe any further se we came baak when we came down there was 
many more moutains and I disided to go on some others well & ni 
went it wasnot very hiegh just like others so when nex were clinb- 
ing it little to sandy we riched the top alright but whenwe wanted 
to come down why we couldn mole so we sat down and slide down 
in that way we couldn get down in that city where we were its too 
cold in summer sometimes its snowing but this little dty was full 
of trees and mountains. 

Sample E. Value 40 
An Exciting Experience 

One day it was very hot and we didn't know what to do. 

This was at our Gym. period, so we went over on the lawn and 
sat under a tree for a while in the shade. 

After a while one of the girls said, Let's play something." We 
all suggested that we would play ghost. We started and played 
for a long while and then we got tired and we sugested we would 
play something else so we played leap frog. 

We were awfully hot now so we sat down in the shade and 
rested ourselves. 

After a while one of the girls sugested that we would play 
statue. We had been playing awhile and it was my turn to be be 
swung around. The girl that was swinging had swung all the 
other girls and they were pretty heavy then she took me and 
swung me around fast not thinking of how light I was she let me 
go and I fell on my left wrist. I heard it crack and I thought it 
was broke, they took me down in the Gym and the Gym teacher 
bandaged it up. It was not broke, but it was sprained. 
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I bet there were fifty girls that asked me where I fell what I 
was doing an how it was done. Well that was the last game that 
I have played since then. 

Sample F. Value 50 
An Accident 

We were out at camp No. 133 which is sittuated near the banks 
of Deep River. One of the men that stayed at this camp owned 
a old duck boat which leaked and if you wanted to ride in it you 
would have to set a certain way ot it would fill with water and 
soon sink. 

My brother saw me paddaling aroimd in it and he decided that 
he would do it himself. 

He weighed about twenty-five lbs. more than me I told him 
the way to set in it but he would not listen but said that one end 
was as good as the other. 

He jumped in and sat down on the nearest end which was 
the wrong end and paddaled out into the river. He paddaled down 
the river for some distance and then turned around to come back. 
By this time the boat was nearly sinking and we saw him paddeling 
as fast as he could go to get back to the bank. 

But it was of no use the boat began to sink and he tried to get 
to the right end but in trying to get to the right end he upset the 
boat and had to swim with all of his clothes on. The water wasn't 
very cold and he swam all the way to up the bridge pushing the 
boat with him. He soon was in dry clothes and was none the 
worse for the accident. 

Sample G. Value 60 
On the Water 

While at a small lake not far from Gary my small brother had 
quite an exciting adventure. The lake is quite deep and though 
my brother can row a boat quite well we never allowed him to go 
out in a boat by himself. 

One day my brother asked mother if he might go down to the 
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beach. She replied, "Yes, but do not get into the boats." Clouds 
began to gather and it looked as though we would have a violent 
electric storm. 

It began to get dark and mother thought of Bud so she sent 
my sister to get him. After a few minutes she came back saying 
that Bud was not on the beach. 

We asked if any one had seen him but no one had. I thought 
that he might be playing with one of the little boys so I went to a 
house on the bluff expecting to find him there. 

While I was on the porch, which over looks the lake, it began to 
thunder & lighten & soon the rain came down in torrents. I stood 
looking out over the lake & and I noticed a boat sway at the other 
side of the lake. 

I ran to my father and told him what I had seen. He hurried 
to the beach & found that one of the boats was missing. Then he 
and my sister took a boat and in about twenty minutes they came 
back with a rather frightened & very much bedraggled little boy. 

He, my brother, showed great presence of mind, for when the 
storm began he was not frightened very much and tried to reach 
the shore. 

You may be sure that he did not dare to go out in a boat alone 
after that. He told us that he would learn to swim first. 



Sample H. Value 67 
My First Morning in Mexico 

We came on a train late at night and had a hard time finding 
an American hotel in San Louis Potesi. After finding one we went 
straight to bed, although my mother took plenty of time to lock 
and prop the door closed. The next morning I woke up early and 
looked out of the window. The first thing I saw was a rickity 
old closed up wagon coming down the street drawn by a few very 
small borros. This strange looking object was the morning street- 
car, the driver was standing in front blowing a small tin horn for 
the people to get out of the way. 

Some of the Peons were getting breakfast, the mother sat on the 
street baking tortellias and the family seated in a circle about her 
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eating as fast as she could bake. The funniest thing about the 
people eating was that the pigs and dogs ran about the outside 
of die circle eating the scraps that were thrown them. 

Down the street comes a man riding on so small a borro that 
his feet touch the ground, he is smoking a cigerette and lazily 
looking about. Behind him walks his wife holding the baby and 
hiting the borro her husband rides on to make it go Behind the 
wife come the children each carrying something. The houses 
along the street are made of adobe, all the windows have bars on 
them which make the house look more like a prison. Over the 
tops of the buildings can be seen the mountains which have nothing 
but the cenutry plant on them. Some borros are. 

Sample I. Value 80 

With a jar and a somewhat business like jolt the rickty 
elevator came to a stop on the basement floor. The door 
swung open and I stepped out into press-room of the Chicago 
Tribune. 

Surrounded by a mass of quiwering steel I was at a loss to know 
what to do. I was suddenly confronted by a bearded man clothed 
in ink smeared overall and jumper. A small tight shop cap was 
set jauntily on one side of his head and a pair of steel grey eyes 
peered at me thru a rather large pair silver rimmed glasses. He 
seemed to be saying something to me but the battery of Hoe 
presses had control of the field and it was only with the greatest 
difficulty that I could hear what he was trying to tell me. 

Beckoning me with an ink stained finger the pressman, for 
such was the position of this man, piloted me around, imder, and 
even over masses of quivering and roaring steel until we stopped 
before a press which stood two and one half stories high, a half 
city block in length and the same in width. The silent pressman 
paused for a moment to glance with pride at the roaring monsters 
when he motioned me again and mounting an iron stairway with a 
brass railing we were soon standing on the top deck of this master 
press. Two and a half stories below me sixteen large rolls of spot- 
tlessly white paper were swiftly unrolling into the press. Wheels 
within wheels whirled and sang and its very song seemed to say. 
"I am the Frank A Munsey the worlds largest newspaper press." 

N 
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The value to teachers in having their own scale even though 
it be roughly determined is as follows : First, thqr have a 
double check on their results by the use of the two scales ; 
second, a composition scale constructed from the material 
which is to be measured is of more value for purposes of com- 
parison; third, in the construction of such a composition 
scale, the teachers acquire a viewpoint and an intimate ac- 
quaintance with the measurement of written composition 
which they would not otherwise obtain. 

Other Tests. — The Extension of the Hillegas Scale for the 
Measurement of Quality in English Composition by Young 
People, by Dr. E. L. Thorndike, is a general merit scale in- 
tended to be used in grades 4 to 12 inclusive. It is made 
up of 29 compositions grouped according to their quaHty 
into 15 units with values ranging from o to 95. 

Some of the qualities have under them as many as 5 and 6 
specimens. On account of the larger munber of compositions 
and a system of marking similar to that which teachers 
ordinarily use, the scale can be used to a considerable 
advantage by the teacher. 

The Harvard'Newton Scales for the Measurement of English 
Composition are made up of four scales for eighth grade com- 
position, one for each of the four types of composition, 
narration, description, exposition, and argumentation. The 
scales were constructed by Dr. Frank W. Ballou with the aid 
of eighth grade teachers in Boston. The compositions were 
written by eighth grade students. Each scale is made up of 
six compositions with values ranging roughly from 40% to 
95%. An important feature of each scale is a notation of the 
merits and the defects of each composition and a comparison 
with the compositions above and below it. 

A Scale for Measuring the General Merit of English Com- 
position, by F. S. Breed and F. W. Frostic was constructed 
from compositions written by sixth grade pupils. It is a 
general merit scale. 
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A Puncttuitian Scale has been constructed by Dr. Daniel 
Starch for the purpose of determining a child's ability to 
punctuate. It is made up of a number of sentences arranged 
in a series of 10 steps which the child is to punctuate. These 
sentences increase in difficulty with each step. Tentative 
standards of attainment have been formulated for the seventh 
and eighth grades. 

A Copying Test was used by a group of Boston teachers to 
determine the degree of accuracy with which pupils copy. It 
is intended primarily for the grammar grades and the high 
school. Such errors are noted as occur in the following: 
Spelling, capitalization, omitted words, and added words. 
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CHAPTER VII 

THE MEASUREMENT OF DRAWING 

How is Drawing Now Measured? — To the teacher of 
artistic temperament who objects that accurate grading of 
work in drawing is impossible, that much work is dependent 
upon sentiment, is intangible, and should be left unmeasured 
so far as the pupil is concerned, the answer is that measure- 
ment in drawing is constantly being made through " grades " 
or " marks '' which are given to children. Grades ranging 
from ioo% down to 60% or from A to F are given year after 
year, term after term, and in some school systems month 
after month. The effort, therefore, to construct a scale that 
will make possible more accurate measurement of drawing 
should be welcomed by every teacher of drawing. Notwith- 
standing millions of measurements or marks each year, there 
are few teachers who themselves have any adequate standard 
for grading which recognizes definite differences in abiUty 
between the pupils of one grade and another, or which makes 
possible any accurate comparison of the drawing work among 
buildings. The old marking or grading system .^as given us 
no accurate method for measuring products in drawing. 
Pupils themselves protest again and again against grades 
received that are based upon the mere whim of the teacher. 
Or perchance the teacher was out late the night before, or 
had a slight case of indigestion, and the pupil's grades are cut 
accordingly. This is but another case of a badly felt need 
for objective standards in grading. 

The Scale and Some Uses of It. — In 1913, Dr. E. L. Thorn- 
dike constructed a tentative scale for the iwe^^va^TCvK^ <^ 
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general merit in drawing. The use of such a scale in 
schoolroom is not only advantageous to the teacher, su] 
visor, and superintendent, but it is appreciated by the pi 
as an instrument of fairness and democracy. With a dei 
scale available for comparisons, pupils themselves will arrii 
after some practice, at reasonable accuracy in judgment 
they will be able to see clearly their own progress from; 
lower to a higher stage of ability. Dr. Thorndike's s( 
while tentative and while making no claim for perfectic 
does quite surely make possible the comparison of a pupi 
work at one time with his work at another time. It pei 
comparison with reasonable accuracy between the work 
one pupil and another pupil or between one room or grot 
of children and another. It permits comparison of work 
one city with the work in another city. It enables supervis 
or superintendents to determine the relative results from o! 
hundred minutes per week in one part of the city over agaii 
fifty minutes per week in another part of the city and 
makes it possible to determine the time allotment for drai 
on a reasonably intelligent basis. It should permit 
comparison of one method of instruction with another wh( 
conditions are properly arranged and conclusions propel 
evaluated. None of these results were possible except in 
very general and indefinite way under the old marl 
system. Under the old system if one hundred pupils in 
entire city had a grade of 95% or above, and if such pupils w( 
scattered in as many different rooms of the city, no 01 
could infer that these one hundred pupils had the same abilil 
or proficiency in drawing. On the other hand if as many 
four pupils in the sixth grade of a large city system reached 
proficiency equal to merit 17 of the Thorndike scale therj 
would be little doubt on the part of any one familiar with 
system that these pupils were fairly equal in ability and thalpJ 
they showed ability very much above the average. In fact 
merit 17 is such a degree of excellence as would be reached 
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by only one pupil out of 50,000, ages 6 to 15, and to discover 
a few pupils in a large city system who had superior ability 
can readily be recognized as information of unusual value. 

Requirements of a Scale. — A scale in drawing as in any 
other subject should permit accurate measurement of differ- 
ences in degree of merit. To weigh out sugar in five, ten, or 
fifteen-pound packages is comparatively simple as the method 
of weighing has been perfected to a very high degree. No such 
degree of accuracy should be expected from the use of the 
drawing scale. The samples of merit on the scale as arranged 
cannot be handled in the same definite and accurate manner 
as the pound weight which the grocer places upon the left- 
hand side of the scale to balance a pound of sugar placed on 
the right-hand side, but a Uttle practice will enable the 
teacher to rank drawings with a fair degree of accuracy. 
This is the proper function of a scale. As the teacher 
becomes more familiar with the scale, and increases her 
experience in its use she will acquire more and more the 
weight-accuracy secured in handling sugar, but it is reason- 
able to expect that she will never quite reach sugar-weighing 
accuracy. 

An explanation of the unit of the Thomdike scale for draw- 
ing will make clear that absolute accuracy is hardly to be 
expected. The formation of the scale is built upon the basis 
of equal differences in judgment and the unit chosen for this 
particular scale is : the difference of merit in children's drawings 
which 75% of artists, teachers of art, and intelligent judges 
generally can distinguish, and which 25% of them fail to dis- 
tinguish. It will be observed that the unit itself allows for 
25% error in judgment by those who are competent to judge. 
If the teacher makes mistakes in equal ratio, one in every 
four, the teacher still improves her judging by the use of the 
scale. It should be noted, however, that the units in the 
Thomdike scale are sufl&ciently large that the error in placing 
a drawing is seldom as great as a full step and that the usual 
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variation of a judge does not exceed half a merit. Dr. 
Thorndike has explained this point as follows : " If the same 
judge should' so rate a thousand drawings, and then, putting 
these ratings aside, rate the thousand over again he would 
vary often by more than half a * merit ' from his previous 
judgments.'' This means that for a high degree of accuracy 
a teacher should rate drawings several times and then take 
the average, or in a contest of unusual importance, the draw- 
ing should be rated by several judges and the average of 
the judgments taken. The necessity of a large number of 
rankings is greatly reduced by the use of a drawing scale 
This was demonstrated by an actual test. Ten teachers 
measured the merit of a drawing by the use of the scale and 
varied only 4 points all told. Ten other teachers measured 
the merit of the same drawing without the scale. They 
were instructed to grade the drawings from o to 17. They 
showed a variation of 14 points, or nearly 4 times as much 
variation without the scale as with it. 

How the Scale was Derived. — The scale as derived by Dr. 
Thorndike is based upon a preliminary study of 45 drawings 
and a more intensive study of 15 drawings. The 15 drawings 
were submitted to 376 judges, 60 of them being artists of 
sufficient distinction to be listed in " Who's Who in America," 
80 being supervisors or teachers of art, and 236 being students 
of education and psychology. The judges were asked to 
rank the 15 drawings according to merit. These 376 judg- 
ments of merit permitted Dr. Thorndike to determine the 
scale steps on the basis of differences in judgment of merit. 
The table which follows herewith summarizes the judgments 
with reference to the relative merits of the 14 drawings finally 
used in the scale. 

With this table as a basis and with a unit defined as the 
difference of merit recognized by 75% of the judges, Dr. 
Thorndike was able by the use of statistical methods to 
assign a definite value to each of the 14 drawings of the scale. 
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Table 46 

94.85 per cent of the judges rated h as better than a 

84.5 per cent of the judges rated c as better than h 

88.45 P^r cent of the judges rated d as better than c 

69.5 per cent of the judges rated e as better than d 

82.55 per cent of the judges rated/ as better than e 

69.7 per cent of the judges rated g as better than/ 

89.4 per cent of the judges rated h as better than g 
81.75 per cent of the judges rated i as better than h 
70. per cent of the judges rated j as better than i 
73.35 per cent of the judges rated k as better than y 

72.5 per cent of the judges rated / as better than k 
86.5 per cent of the judges rated m as better than / 
74.2 per cent of the judges rated n as better than m 

It is scarcely to be assumed that a unit in one part of the 
scale is exactly equal to a unit in another part of the scale. 
It is, however, safe to assume that the scale is entirely on a 
ranking basis, that is, that the drawings are arranged in the 
order of merit. It is also safe to assume that the units are 
approximately the same in one part of the scale as in another 
and that progress of one unit in any part of the scale is 
approximately equal to similar progress in another part of 
the scale. Dr. Thorndike gives warning, however, that the 
exact numeral relations of the scale should not be used too 
freely in scientific quantitative studies of achievement and 
improvement in drawing. For the practical purposes of 
measuring drawing work in the pubUc schools, however, no 
harm will result from taking the scale at its face value. 

While it is unnecessary to attempt to understand in detail 
Dr. Thorndike's method of determining the values of the 
drawings in the scale, an idea of the procedure may be easily 
grasped by considering the extremes of the situation.^ If 
drawing x were judged as better than drawing y by 
exactly 50% of a large group of competent judges, it is 

^ Students interested in fully understanding the procedure are referred to 
Thoradike's "Mental and Social Measurements," Chap. VIII. 
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apparent that just as many judges thought x better than 
y as thought y better than x. If the judges were suf- 
ficiently competent and nimierous, we are justified under 
the conditions in assuming that drawings x and y are 
equal in merit. On the other hand if ioo% of a large 
group of competent judges ranked drawing y as better 
than X we would only know that y was " vastly superior " 
or " so superior as to be in an entirely different class." 
Now, manifestly, there are all grades of difference rang- 
ing from a 50-50 judgment of equality to a o-ioo judg- 
ment of superiority, and some place in between these two 
would be found the degree of superiority that is recognized 
by only 75% of the judges. This is the unit of merit of the 
Thorndike Drawing Scale. By calling the first drawing 
(drawing a) in the scale zero^ and remembering from the table 
above that 94.85% of the judges rated drawing b better 
than drawing a, the mathematics of the case gives 2.4 as the 
merit value of b. In like manner the other values of the 
scale were determined, — 3.9, 5.7, 6.5, 7.8, 8.6, 10.5, 11.8, 
12.6, 13.5, 14.4, 16.0, and 17.0. The last two steps in the 
scale are almost exactly i merit apart, as n was judged better 
than m by 74.2% of the judges. So if m has a value of 16.0, 
n should have the value 17.0. It thus appears that such an 
intangible situation as differences in judgment may be made 
the basis of mathematical calculation for the formation of a 
definitely evaluated scale. 

Limitations of the Scale. — The chief limitation of the 
scale is that it is based upon general merit in drawing and is, 
therefore, not analytical of different t)rpes of drawing. In 
fact the scale combines various types of drawing into a single 
scale and thus relates to comparisons of materials that in 
many respects are not comparable. In time we should have 
scales developed that will deal with merit in particular lines 
rather than general merit. For instance the following possible 
unit scales will be recognized by a teacher of art and drawing : 
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a. Pencil drawings. 

b. Charcoal work. 

c. Sepia originals. 

d. Sepia reproductions. 

e. Water color. 

f . Mechanical drawings. 

g. Maps and informational designs. 

The teacher may even make divisions along entirely diJBferent 
lines, resulting in such divisions as decorative, photographic, 
informational, representative, etc. ; or even along such simple 
lines as types of subjects, as, trees, buildings, fruit, animals, etc. 
The t)rpe of art work carried on in different cities varies 
considerably. The supervisor of art and drawing will doubt- 
less find it especially helpful to use the Thorndike drawing 
scale for grading her work until she can supplement it by 
other scales, made by ranking and evaluating the work of her 
own pupils. Scales resulting in this way will be more easily 
understood by grade teachers and pupils, and they will also 
be more fully appreciated. The most useful scale for lower 
grade work will doubtless be a scale of merit for products in 
water color. Comparison with the Thorndike drawing 
scale will be the easiest and simplest method of procedure in 
making such a scale. At first the supervisor may have diffi- 
culty in filling in all of the steps on the scale. It may take 
the larger part of a year to complete this scale and others 
needed to cover the various phases of art work undertaken in 
the particular city, but all the while she will be collecting 
samples, comparing, rejecting, replacing, and getting her own 
ideas more and more definite. The result will be that by the 
time such scales are completed their value will be apparent 
and their usefulness unquestioned. Scales thus made up 
from the work of the pupils in a school system will be more 
useful in many respects and will accomplish some purposes 
not possible with the Thorndike scale, even thovx^k 1\^.^ -^^ 
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much less accurate and not at all scientifically constructed. 
They will result in constant reference to the Thomdike scale 
and in a much better understanding of it. 

With reference to the extension and refinement of his own 
drawing scale, Dr. Thorndike makes the following conmient : 
" These limitations can be remedied in part, though we may 
not expect to measure the merit of a child's drawing as easily 
and accurately as the weight of his body. The scale can be 
extended in scope to include specimens of maps, mechanical 
drawings, decorative drawings and designs of various sorts, 
drawings of specified objects, and the like ; and with proper 
methods of investigation and enough labor, specimens can be 
found of these several sorts of drawings of exactly i * merit,' 
i| ' merits,' 2 ' merits,' and so on up to 17 ' merits,' which 
is as far as a scale for children's drawings needs to extend. 
The labor involved is, however, very, very great. In order 
to get one specimen proved to be between .99 and i.oi, it 
would probably be necessary to collect with great care at 
least fifty drawings by very stupid children, to have them 
measured by at least 100 judges in comparison with the scale 
as it now is, then to select 20 of them to be so measured by 
200 more judges, and then to select ten of these to be measured 
by 300 more judges, and finally to have the two or three or 
four of these that were between .99 and i.oi, by the opinion 
of the 600 judgments so far, measured by 400 more judges. 
If the present scale were not at hand as a basis, the labor would 
be much greater. The improvement of the scale in these 
respects must then be a gradual achievement of several 
years." 

Grade Standards. — The fixing of grade standards in 
drawing or the various phases of art work will gradually 
follow an extensive use of scales with definitely evaluated 
units. The first step in fixing grade standards will be to 
agree upon a set of rules or instructions for taking samples of 
the work of pupils. These instructions will doubtless need 
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to indicate a range of subjects or problems within the ability 
of the various grades tested and then fix time Umits for the 
tests. However, this is only a guess as to how the instructions 
should be drawn. It may be that there should be no time 
limit, or that the best work of a pupil for a month or a term 
should be taken for grading. The art supervisor who works 
out satisfactory instructions and fixes grade standards will 
be performing a worth while educational service. 

Childs reports ^ a preliminary use of the Thorndike scale 
during the winter of 1914. A total of 2177 pupils in two 
school systems was involved in the test. For the purposes 
of the test, a new scale was constructed by including all the 
hiunan figure samples of the original Thorndike scale, and 
adding snow scene samples, thus making a scale with a total 
of 16 human figure and snow scene samples. In administer- 
ing the test, a time limit of ten minutes was observed and the 
subject was assigned as follows : " Scene or picture with snow 
on the ground and boys or girls doing something, as snow- 
balling, coasting," etc. 

The following median scores resulted from the test : 



Table 47 



Grade. . 


iB 


lA 


2B 


2A 


3B 


3A 


4B 


4A 




Median . 


S.5(?) 


6.1 


5-9 


7-5 


7-9 


7.6 


7.5 


8.3 




Grade. . 


5B 


SA 


6B 


6A 


7B 


7A 


8B 


8A 


All 


Median 


8.2 


8.4 


8.5 


9.7 


8.7 


9.3 


S.8 


9.5 


8.09 



Childs thinks that the above medians should not be taken 
as norms for grade performance, but that norms, if based upon 
this study, would be by grade groups as follows : 

^ See Bibliography at close of chapter. 
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Table 48 





Group 


Grade 


Characteristic Score 


I . . 




iB 


4.S(?), 5.5 


n . 




1A-2B 


5.5, 6.5 


Ill . 




2A-4B 


7.2, 8.0 


IV 




4A-6B 


8.0| 8.6 


V . 




6A-8A 


8.6, 9.3 



This study, while brief and preliminary in nature, confirms 
Dr. Thorndike's suggestion that differentiated scales are 
desirable. The " human figure, snow scene scale '' used in 
this study was much more satisfactory than the Thorndike 
general scale. The study also confirms the findings of Barnes, 
Ludens, Burk, and Gotze, that children show a plateau of 
non-development in drawing from the age of nine or ten on to 
adolescence. It leads, therefore, to the question as to whether 
the time schedule in drawing should not be greatly reduced 
for most children from ages 10 to 14. There is little doubt 
but that extensive use of objective standards for grading 
products in drawing would do for that subject what it has 
done for other subjects. It would lead to a better selection 
of subject matter, improved methods of instruction, a revision 
of time schedules, and provision for handling children accord- 
ing to capacity and needs. 

Using the Thorndike Scale. — The drawing scale is used in 
the same manner as the writing scale. The specimen of the 
pupil's work is moved back and forth on the scale until the 
scale specimen to which it corresponds most nearly in general 
merit is found. This gives the value of the pupil's work in 
terms of the scale, and the scale value should be placed as 
the " grade " on the pupil's work, if a " grade " is desired. 
For example, if the pupil's drawing corresponds in general 
merit to the sixth specimen on the scale, merit 7.8, then the 
pupil's grade is 7.8. 
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When first beginning the use of the scale there wiU be a 
tendency to place the sample on the scale according to the 
kind of drawing, rather than general merit. This is a diffi- 
culty which is soon overcome by practice in the use of the 
drawing scale just as it is in the use of the writing scale. The 
scale is constructed on the basis of general merit, and the 
specimen is to be located on the scale on the basis of general 
merit regardless of the kind of drawing. Pupils have diffi- 
culty on this point when the scale is first used, but it is sur- 
prising how quickly pupils as well as teachers advance to the 
stage of recognizing general merit in the use of the scale. 
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CHAPTER VIII 

THE MEASUREMENT OF OTHER GRADE SUBJECTS 

The tool subjects of the grades are being measured with 
success and with beneficial results on teaching and curricula 
making. Can the content subjects — such as history, geog- 
raphy, physiology, literature, nature study, and elementary 
science — be measured with equal success and equally bene- 
ficial results? The answer is that many attempts are being 
made, that success has not been attained, and that final 
success is still in doubt. A scientific test or scale for grading 
a subject is merely a reasonable examination which has been 
carefully graded and evaluated, i,e, standardized. Any fixed 
or rigid examination scheme tends always to formalize the 
teaching of a subject. For the formal phases of the tool 
subjects this is desirable, assuming good teachers and provision 
for adequate motive. But can we formalize the teaching of a 
content subject without undesirable results, or can we apply 
standard tests to the more formal informational phases of 
such a subject without its resulting in misplaced emphasis by 
many teachers, a large majority of them? It is very doubtful. 
At any rate, it remains an open question. 

Standardized tests ^ have been devised in United States 
history, geography, language, and grammar. 

History. — At least seven standardized tests are available 
in United States history. Five of these test information 
only ; namely, those by Bell and McCuUum, Harlan, Starch, 
Davis, and Raynor. One test, by Van Wagenen, is devised 

^ See Bibliography at the close of the chapter for references and list of 
available tests. 
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to test thought and character- judgment as well as information. 
Barr is working on a set of diagnostic tests in United States 
history, covering information, thought, reasoning, and judg- 
ment. The form of the history test varies. Harlan, Starch, 
and Raynor use the completion test form. Every test has 
made an effort to make the questions simple and definite so 
as to secure answers that can be graded as either right or 
wrong, thus simplifying and standardizing the grading. In 
the tests dealing with thought, reasoning, or judgment, this 
can be done to a limited extent only. 

The Bell and McCullum test is one of the first devised for 
testing history and is a good illustration of the informational 
type. It was first made available in 191 7, but apparently it 
has not been extensively used. The test consists of seven 
parts, as follows : 

I. Give the reason for the historic importance of each of ten 
representative dates (Dates — Events). II. Indicate for what 
each of ten prominent men was celebrated (Men — Events). 
III. Mention the name of the man prominently connected with 
each of ten historic events (Events — Men). IV. Define in a 
short sentence each of ten historic terms (Historic Terms). 
V. Make a list of all the political parties that have arisen in the 
United States since the Revolution, and state one principle advo- 
cated by each (Political Parties). VI. Indicate the great divisions 
or epochs of United States history (Divisions of History). VH. On 
an outhne map of the United States (supplied) draw the land 
boundaries of the United States at the close of the Revolution, 
and indicate the different acquisitions of territory since that 
date (Map Study). The questions were as follows : 



I. Dates — Events. 


(Four 


minutes.) 






I. 1861. 






6. 


1619. 


2. 1789. 






7- 


1783. 


3. 1620. 






8. 


1492. 


4. 1565- 






9- 


1776. 


5. 1898. 






10. 


1846, 
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II. Men — Events. (Five minutes.) 

1. John Burgoyne. 

2. Alexander Hamilton. 

3. Jejfferson Davis. 

4. Walter Raleigh. 

5. John C. Calhoim. 

6. Cyrus H. McCormick. 

7. George Dewey. 

8. Sam Houston. 

9. Roger Williams. 
10. James Oglethorpe. 

III. Events — Men. (Three minutes.) 

1. Captured Quebec during French and Indian War. 

2. Discovered the North Pole. 

3. Wrote the Declaration of Independence. 

4. Invented the telephone. 

5. Brought about the Missouri Compromise. 

6. Captured the City of Mexico during the Mexican War. 

7. Founded the Colony of Maryland. 

8. Made a great speedi against the English Stamp Tax. 

9. Was President of the United States during the Civil 

War. 
10. Vetoed the re-chartering of the United States Bank. 

IV. Historic Terms. (Seven minutes.) 

1. Second Continental Congress. 

2. Lewis and Clark Expedition. 

3. Articles of Confederation. 

4. Sherman Anti-trust Law. 

5. Monroe Doctrine. 

6. Fugitive Slave Law. 

7. Dred Scott Decision. 

8. Alien and Sedition Laws. 

9. Nullification Ordinance of South Carolina. 
10. Emancipation Proclamation. 

V. Political Parties. (Five minutes.) 

VI. Divisions of United States History. (Five minutes.) 

VII. Map Study. (Five minutes.) 
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The tests are easily administered. There should be 
further specific directions as to scoring the separate questions. 
The tests were originally given and standardized on the basis 
of the answers of students selected from the Texas normal 
schools and the University of Texas. There were 523 students 
from grades six and seven, 668 high school students, 207 normal 
school students, and 75 students from the University of Texas. 
No attempt has been made to fix grade standards. The test 
was used originally in order to study the question, " What will a 
carefully constructed information test in United States History 
reveal regarding individual, sex, and school differences? '' 

Doubtless the most valuable purpose that can be served 
by this scale is that of the study of the effectiveness of various 
methods in fixing traditional facts in the minds of the children. 
The test is one of the most valuable of existing tests on the old 
type of history, which has for its end the mastery of the facts 
in the traditional course. It is doubtful if the test affords 
even a comprehensive review of old chronological history or if 
the details of each test are well selected. Under the Dates — 
Men test, 1846 would not be selected as an important date 
outside of Texas. It is doubtful if 1565 is one of the important 
dates in United States history. Under the Men — Events 
test, it is very doubtful if the ten most important men in our 
history are mentioned. There is a tendency, throughout the 
entire series of tests, to place as much emphasis on the earlier 
phases of United States History as upon the later. An 
examination of these tests gives rise to the question as to 
whether or not they will perform any desirable service in the 
hands of teachers for examination purposes. The same 
question is properly raised with reference to the Harlan tests, 
the Starch tests, the Davis tests, and the Raynor tests. 
They seem to miss the fundamental purpose of the review or 
examination as an instrument of teaching, and their tendency 
is to place emphasis upon the phases of history that are less 
important. 
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Fig. II 
VAN WAGENEN AMERICAN HISTORY SCALES 

Information Scale A 

Name Sex Grade Sch^l 

When was your last birthday? How old were you? Date. 



I. What people did 
Columbus find in Amer- 
ica? 


2. Name any American 
general. 


3. In what did the In- 
dians live? 


4. Who was President 
of the United States dur- 
ing the Civil War? 


5. By what people was 
our Thanksgiving Day 
custom started? 


6. With what country 
did the United States have 
war in 1898? 


7. Name any man be- 
sides Columbus who made 
early explorations in 
America. 


8. In honor of what 
event do we celebrate the 
Fourth of July? 


9. What were the two 
chief occupations of the 
Indian men? 


10. Arrange these events in the order in which they occurred by putting a "i" 
before the event that occurred first, a " 2 " before the event that occurred second, 
and so on until you have put a "5 " before the event that occurred last. 

Struggle between the French and the English for control in America. 

Rise and growth of the United States as a nation. 

Discovery of America. 

Settlement of America by European nations. 

Struggle of the American colonies against European control. 


II. In what war was 
the battle of Gettysburg 
fought? 

The battle of Trenton? 
The battle of Lake Erie ? 


12. What was Henry 
Hudson looking for when 
he sailed up the Hudson 
river? 


14. What were the first 
four European countries 
to make settlements in 
America? 


13. Who was President 
of the United States when 
Louisiana was purchased ? 


15. Who was the Brit- 
ish general in each of these 
battles : 

Battle of Saratoga? 
Battle of Yorktown? 


16. During what war 
did iron war vessels first 
come into use? 


18. What important 
means of communication 
were invented and put 
into use between 1835 and 
1845? 

Between 1870 and 1880? 

Between 1895 and 1910? 


17. What group of In- 
dian tribes Uved in the 
western part of New York 
State? 
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The Van Wagenen tests ^ are referred to as scales. There 
is an information scale, a thought scale, and a character 
judging scale. It is doubtful if the term scale is properly- 
applied. They will be referred to here as tests. The in- 
formation test is more extensive than the Bell and McCullimi 
test. It consists of 32 questions, some of which have several 
parts that are practically equal to additional questions. 
Figure 11 shows a section of the scale, including the first 18 
questions. It is quite evident that the author, in these 
questions, is attempting to get a good sampling of the in- 
dividual responses of the pupils on history information. 
How fully and to what advantage they can be used by the 
individual teachers yet remains to be seen. For the superin- 
tendent who desires comparison among schools or teachers, 
or for the educational expert, who desires to survey an entire 
school system, they will afford comparisons which should 
form the basis of valuable inferences. The effects upon the 
curriculum of frequent uses of the test will need to be watched 
carefully and properly guarded. 

The thought test is significant in that it recognizes the im- 
portance of thought or content considerations in the study of 
history. It is at least to be commended as a first attempt at 
attacking this difficult phase of history work. Questions 
1,2,3, 7, 13, 19, and 22, which follow herewith, are illustrative 
of the questions in the thought test. 

1. Before the steamboats were made people used to travel on 
the ocean in sailboats. Steamboats were not made until a long, 
long time after the European people came to make their homes in 
America. 

How do you think these early European settlers came to 
America? 

2. A little before the year 1500 the people of Europe were 
anxious to find a new way to get to India. Some people thought 
that India might be reached by sailing westward across the At- 

^ See Bibliography at close of chapter. 
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lantic Ocean. Columbus was one of these people. It was at this 
time that Columbus foimd America. 

What do you think Columbus was looking for when he found 
America? 

3. A hundred years ago it took a letter several days to go from 
New York to Boston. To-day it takes only a few hours. 

Why do you think it took letters so much longer to go from 
New York to Boston 100 years ago than it does to-day ? 

7. In 1829-30, it took over 160 hours of work to raise 50 bushels 
of wheat ; in 1895-96, it took less than seven and a half hours of 
work to raise the same amoimt. 

How can you account for the difference ? 

13. In 1660, the English Parliament passed the restrictions 
that certain colonial products, called enumerated* articles, includ- 
ing sugar, tobacco, dyewoods and indigo, should be shipped from 
America only to England or to other English colonies. 

In 1663, an act of Parliament provided that all goods brought 
to the colonies must come from or through English ports. 

What do you think was the purpose of the English in thus seek- 
ing to regulate the trade of the colonies? 

19. At the outbreak of the Civil War there were comparatively 
few factories for spinning and weaving of doth in tie South. 
They could no longer get doth from the North and the Northern 
blockade shut it out from England. Besides they had little ma- 
chinery and no means of making machinery for spinning and 
weaving. 

In such a crisis how do you think the people of the South ob- 
tained the doth necessary for dothing? 

22. At the dose of the Revolutionary War many of the people 
in America were driven from their homes by oflSdal acts of a new 
state government, their property was taken, and they were de- 
prived of the right to vote or to hold public offices. 

How can you account for such action? 

No attempt will be made to give a critical evaluation of this 
test. That must be determined by its more extensive use. 
It may be noted in passing, however, that 13 of the 22 ques- 
tions in the thought scale relate to history preceding 181 2. 



The Measurement of Other Grade Subjects 199 

Studies which have been made of the relative importance of 
facts in American history as indicated by present social usage 
clearly point to the fact that the author of this test has 
greatly misplaced the emphasis. In fact, there is little 
evidence throughout the entire test that the author has in 
mind that history can be used in solving present-day 
problems. 

The character-judging test consists of fifteen questions 
dealing respectively with the following topics : (i) white man's 
response to Indian treachery, (2) Nathaniel Hale, (3) John 
Quincy Adams's refusal to remove a political opponent from 
office, (4) John Quincy Adams and the right of petition, (5) 
an Indian father's love for his son, (6) Fletcher and the Earl 
of Belmont as governors of the New York Province, 1692-1698, 
(7) EngKsh Colonial soldiers, against the Indians in Massachu- 
setts, 1724, (8) Secretary Stanton's behavior in tearing up a 
decree from President Johnson, (9) Indian Warfare, (10) Indian 
Warfare, (11) Indian Warfare, (12) Parliamentary retort, 

(13) St. Clair and Butler against the Northwestern Indians, 

(14) Political prejudice, (15) Difference between Lieut. Derby 
and Secretary of War Davis during President Pierce's adminis- 
tration. 

It will be observed that 7 of these 15 questions deal with 
Indians or Indian warfare in some form. One deals with 
colonial government, at least two with the question of poKtical 
prejudice, and the latest date of any of the events is the one 
referring to Secretary Stanton during the administration of 
President Johnson. In view of this analysis, one may properly 
doubt the adequacy of the questions for testing character- 
judgment in history, particularly on a basis of present 
utiKty. The characters are too far removed. The appeal 
is not in any case strongly motivated. The examination, 
therefore, with this set of questions is sure to be largely a 
formal matter so far as the children, or even the teacher, are 
concerned. 
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Questions i, 8, and 12 are quoted herewith. 

I. In 1772, there was a frontier wedding. The guests had come 
from many miles. After a night of rough merriment and dancing 
the guests lay down to sleep under the roof of their host or in the 
near-by barns and sheds. When morning came two of their horses 
were missing. Not doubting that they had strayed away, three of 
the young men started out to find them. Soon several gunshots 
were heard and the three young men did* not return. Believing 
that it was a small scalping party of Indians eight or ten more 
mounted the horses that stood saddled before the house and 
galloped across the fields in the direction of the firing; while 
others ran to cut off the enemy's retreat. 

Draw a line under the three of the following words which you 
think best describe the action of these white men. 

indifferent cowardly cautious polite brave 
courageous spiteful fearful daring timid 

8. General Grant had been very positive in demanding that 
all oflScers of the Confederate army should enjoy their Uberty. 
Among those who had been imprisoned by order of the Secretary 
of War, Edwin M. Stanton, was General Clement C. Clay, an ex- 
United States senator from Alabama. He was taken ill in prison 
with asthma, and his wife came to Washington to solicit his re- 
lease. She went to President Johnson, and he gave her the neces- 
sary order, which she took back to Secretary Stanton. Stanton 
read the order, and, looking her in the face, tore it up without a 
word and pitched it into his waste-basket. The lady arose and 
retired without speaking ; nor did Stanton speak to her. 

Draw a line under the three of the following words which you 
think best describe this action of Secretary Stanton. 

cautious tactful callous generous courteous 

thoughtful sympathetic rude insolent considerate 

12. General Smyth was remarkable for long, prosy, intermina- 
ble speeches in the House of Representatives. On one occasion, 
in the committee of the whole, after having wearied the patience 
of the members more than usual, he said to Mr. Clay, who sat 
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near him, in a low voice, while he was pausing for a new start, 
"You speak for the present generation ; I speak for posterity," — 
"Yes," replied Mr. Clay, "and you seem resolved to continue 
speaking till your audience arrives." 

Draw a line under the three of the following words which you 
think best describe this action of Henry Clay. 

kind bitter sarcastic generous cautious 

hmnorous ignoble abusive sympathetic ready-witted 

The makers of history tests, thus far, have failed to com- 
prehend the true purpose of a review or examination lesson. 
Such a lesson should give a new view, should secure a re- 
organization of subject matter, and should provide for use or 
application. It should, in the best sense of the word, re- 
enforce the true purposes of the original teaching of the 
subject. In the case of history, if it is to serve the purpose oi 
civic eflBiciency on the basis of social utility, much greater 
emphasis must be placed upon the organization of the material 
in the form of large motivated problems which look forward 
to present-day applications. The time spent on the modern 
periods of history must be greatly increased. The provision 
for carrying over and applying to present-day problems 
must be made on a more adequate basis. A review or exami- 
nation lesson should serve all of these purposes. To date, no 
test has been standardized which does accomplish these pur- 
poses. If standard tests are to be of service in history, they 
must be so constructed as to eflEect the desirable basic aims 
and outcomes of the subject. When this purpose has been 
served, the other subsidiary purposes, such as distribution of 
the children according to ability, a diagnosis of classroom 
results, and a more accurate grading of pupils, will follow. 

Diagnostic Tests in History. — Dr. Truman L. Kelly, 
in an experimental study of the analysis and prediction of 
ability of high school pupils,^ has included a history test. 
This has not been developed and used sufficiently to indicate 

^ See Bibliography at the close of the chaptec. 
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its value, but there is, in this use of a test in history, a sugges- 
tion of possibilities which needs fiurther attention. A test 
which is used merely to discover ability, in order to properly 
advise students to continue further work in the line, or to dis- 
cover lack of ability, in order to advise students to dis- 
continue work, — this is a use of the test which is less likely 
to formalize a content subject and which, when properly 
understood, has connected with it no undesirable results. 

Geography. — The tests available in geography at the 
present time are tests of the formal informational phases of 
the subject. The Starch tests ^ cover the elements of five 
geography texts, and have been arranged in five parallel tests 
of equal value. They are in the form of mutilated sentences. 
They are of limited value for the reasons given in the 
introductory paragraphs of this chapter and under the dis- 
cussion of history. 

The Hahn-Lackey Geography Scale. — The Hahn-Lackey 
geography scale ^ is an illustration of the application of 
scientific procedure on an extensive plan, the result being a 
scale involving both fact and thought questions developed 
on the plan of the Ayres spelling scale. The scale consists 
of about 200 questions, graded for difficulty for the fourth, 
fifth, sixth, seventh, and eighth grades. The questions are 
based upon textbooks and cover the conmion subject matter 
of six recent texts. The scale, with complete instructions for 
grading each question, can be secured from the authors. 
Figure 12 shows typical data from the scale. A careful study 
of this section will doubtless convince any reader that the 
larger purposes of geography will not be furthered by the use 
of the test. While an attempt is made to involve thought 
questions, it will be seen that even these deal chiefly with 
fact or informational phases of the subject. It is doubtful, 
therefore, if the general use of this scale will be an advantage 
in any school or school system. 

^ See Bibliography at the close of the chapter. 
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Fig. 12. — A Scale for Measuring Ability of Children in Geography in 

Grades 4, 5, 6, 7, and 8 



Grade 


G 


'4 


1 


6 


4 


6 


6 


7 


12 


8 


12 



307. Name three 
agencies or pro- 
cesses at work mak- 
ing rocks into soil. 

215. By what 
states would you 
pass in going by 
boat from Cincin- 
nati to Memphis? 

160. Why is most 
of the rainfall of 
Australia limited 
to the eastern and 
south-eastern 
coasts? 
162. Much of 
India receives from 
12 to 16 inches of 
rainfall in July and 
less than 1 inch in 
January. Explain. 

225. Which is 
the greater distance 
and why, 30 degrees 
west of Washington, 
or 80 degrees south 
of Washington? 

216. New Orleans 
is in 80 degrees 
North Latitude and 
St. Louis is in 39 
degrees North Lati- 
tude. They are in 
the same Longi- 
tude. About how 
far apart are they 
in miles? 



12 



21 



21 



IIS. Name two im- 
portant valleys of the 
United States near 
the Pacific coast. 

127. Why is min- 
ing an important 
business in the Ap- 
palachian region? 

217. Name five 
natural wonders of 
the United States. 



114. Why is New 
York so importaxA 
as a dairying state? 

132. Why doesn't 
California grow 

much com? 

172. Why is the 
Trans-Siberian rail- 
road of so much im- 
portance to Russia ? 



173. Why is the 
Niger river of less 
imi)ortance than the 
NUe? 

187. Give one 
reason why Chicago 
rather than St. Louis 
has become the rail- 
road center of the 
middle west. 



58 



78 



79 



52. What is the 
largest city of your 
state? 



64. Where is 

Alaska and to whom 
does it belong? 



84. Name four 
large cities of 
Europe. 



92. Give the capi- 
tals of France and 
Germany. 

1 01. Name two 
large bodies of water 
that border on 
Florida. 

45. Name four 
things you use for 
food that do not 
grow where you live. 



68. Give one 

reason why so many 
of the great cities of 
the United States are 
near the sea-coast. 

72. Which is the 
coldest and which 
the warmest part of 
South America? 



73 



84 



88 



94 



94 



63. What country 
is north of the 
United States and 
to whom does it be- 
long? 

102. Name two 
other countries in 
North America be- 
sides the United 
States. 

24. Name five 
wild animals. 



5. What two 
oceans border on the 
United States? 

43. Name a plant 
used for making 
cloth. f 

49. Write your 
whole address. 

27. Name two 
kinds of work that 
men do in getting 
materials for build- 
ing houses. 

26. Name two 
kinds of work that 
men do in getting 
food for us. 

84. How can you 
tell from what di- 
rection the wind is 
blowing? 

50. To whom do 
the streets or roads 
belong ? 
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The Boston Tests in Geography. — These tests were 
prepared under the guidance of an educator who had given 
careful consideration to the true aims of geography in the 
schools. The result is that the two tests, one on the geography 
of the United States, and the other on the geography of 
Europe, consist of questions well chosen from the thought 
standpoint, and questions that are likely to have an influence 
entirely in the right direction in the teaching of the subject. 
While the tests have never been fully standardized and are 
not available, they are of such significance in showing develop- 
ment in the right direction, that it will be worth while to 
describe them. This can be done in the words of the author.^ 

The test was prepared with a view of ascertaining : 

(Ja) The character of the geographical knowledge of the pupils 

tested ; 
{h) The ability of the pupils tested to reason from geographical 

data; 
{c) The relative adequacy of their knowledge of the general 

geographical features of the United States and Europe ; 

and 
[d) Whether scientific measurement of educational results in 

geography is possible. 

The Scope of the Test Which Was Given 

It is obvious that a forty-five-minute test can cover only a 
limited field of geography. Therefore, the test was confined to 
the most important countries of the world, viz. the United States 
and the countries in Europe. Although these countries are studied 
chiefly in the fifth and sixth grades, by no means does it follow 
that simply fifth and sixth grade work was tested. The study of 
Europe and Canada in the sixth grade should certainly include the 
review of many essential features of the geography of the United 
States. In the seventh grade the work with Asia and Africa should 
involve not a little review of both the United States and Europe. 

^ See Bibliography at close of the chapter. 
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Indeed, the makers of a course of study cannot be justified in 
devoting so much time to Asia and Africa as is the case in our 
present course, unless such study requires full explanation of the 
relationship existing between these countries and the more pro- 
gressive countries of the world. Through the study of such re- 
lationship, there is obtained a definite review of many important 
facts and principles of the geography of the United States and 
Europe. 

Aims of Geography Teaching 

As is well known, the conception of geography teaching to-day 
is quite different from that of fifty or even twenty-five years ago. 
Then the study of the subject consisted largely in memorizing 
definitions, in learning the location of places, and in learning un- 
related facts about the different countries of the world. 

At the present time we consider that the value of geography 
lies not so much in a knowledge of facts concerning the earth and 
its people as in an understanding of the various ways in which 
man's activities are influenced by physical environment. 

As a result of the study of geography in the elementary school 
the pupil should gain : 

1. An abiding interest in the different peoples of the world, 

their industries, their achievements, and their relations to 
ourselves. 

2. A mastery of geographic facts and principles suflicient to 

enable him to explain : 

(a) The growth of the leading cities of a region. 

(b) The development of important industries. 

(c) The dependence of one part of the world upon another. 

3. A breadth of mind which will lead to a sympathetic under- 

standing of races and nations other than his own. 

4. A working knowledge of the subject by a thorough training 

in the use of maps, texts, and reference books so that he 
can work out new problems independently. 

In short, geography should help the pupil to interpret his en- 
vironment, which in the case of civilized man reaches out to all 
parts of the world. 
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Questions on United States 

(An outline map of the United States was printed at the head of 
the questions.) 

I. Locate on the map the cities named at the right : 



2. In the column marked 
"products," write opposite the 
name of each city the name of 
a product for which the city is 
noted. 



Cities 
MinneapoUs 
Pittsburgh . 
Lowell . . 
New Orleans 
Duluth . . 
Galveston . 
Lynn . . 



Products 



3. Give reasons for the growth of Minneapolis. 

4. Below is given a list of articles which we use in our homes. 
Write below each word the name of the state in which that article 
is produced in large quantities : 



cotton 



oranges 



cane sugar 



nee 



coal 



iron 



5. Write on the map the name of each state which you have 
just written in answering Question 4. 

6. Why do the states just east of the Rocky Moimtains receive 
less rain than Massachusetts ? 

7. Explain the way in which the flood plains of the Mississippi 
River have been formed. 



Questions on Europe 

(An outline map of Europe was printed at the head of the 
questions.) 

1. Locate on the map two seaports of European Russia. 

2. Why are the seaports of Russia not so important as the sea- 
ports of England? 

3. Of what value to the countries of Europe are their colonies 
in other parts of the world? 
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4. Why does England import large quantities of wheat? 

5. Write on the map the names of the leading manufacturing 
countries of Europe. 

6. Why has Germany become very important as a manufac- 
turing country? 

7. Why is the climate of Italy different from that of Germany? 

The results of the test show that it is possible to ascertain 
by carefully selected tests whether or not the true aims of 
geography have been accomplished in the teaching. It is 
evident that pupils may remember locational facts without 
being able to use these in any adequate way in answering the 
questions which occur to one in daily life. This means that 
locational facts should be properly subordinated to other more 
vital phases of the subject. The close relationship between 
questions i and 2 in the test on the United States shows the 
correct method of fixing in mind the location of places through 
the study of facts which make those places worth remembering. 
The important consideration is not the locational facts, but 
the reasons behind them. There is little or no value in 
knowing the location of places to which no significance is 
attached. 

While it is possible that the standardizing of these tests 
would in time have worked harmful results in the Boston 
schools, they do indicate the type of questions that should 
form the basis of examinations or tests in geography. 

Language Tests. — Tests devised by Starch^ and Charters^ 
are available for testing the language errors of children. The 
Starch tests are designated " Grammatical Scale A," " Gram- 
matical Scale B,'' ** Grammatical Scale C." The same t)T)e 
of material is contained in each scale, although they are not 
guaranteed to be of equal value. The use of the three tests, 
however, would give a more accurate measure of the child's 

^ " Educational Measurements," Chap. 7. 
' See Bibliography at close of the chapter. 
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language ability. The tests are built on the plan of choice 
of words. They are illustrated by the following form (Scale 
A, Step 5, Sentence 2). 

" The Gazette reported (he; him) to be dead." 
The pupil is to mark out the incorrect expression, leaving 
the correct one. The chief criticism of the Starch tests ^ 
is that they have not been devised on the basis of an extensive 
study of the errors actually made by children. For instance, 
in Scale A, there are thirty-seven errors. Eighteen of these 
do not appear in the Connersville, Boise, and Kansas City 
studies of the common errors of grade children. Of the other 
19, there are 2 on the double negative; 2, sequence of tenses; 
3, use of shall and will; and 7, choice between the objective 
and nominative form of the pronoun. In view of the in- 
adequacy of these tests, it seems unnecessary to go into detail 
with reference to the scoring or use of the tests. Quite surely, 
however, the teacher will find in these tests some helpful 
suggestions. Tentative standards are given by the author, 
making possible comparison. 

The Charters^ Langtcage Test covers only pronouns. It is 
based upon a careful study of the pronoun mistakes of school 
children, as indicated by the collection and study of more 
than 25,000 errors that pupils made in using pronouns in 
their oral language. The test consists of sentences, such as : 
" 8. Who do you want? 

" 16. They made baskets and filled it with holly." 
in which there are errors in the use of a pronoun. The pupil 
is required to write the correct form. The test is designed to 
be used in grades three to eight, and to measure the pupil's 
ability to use the correct forms of pronouns. The test is of 
the right t)^e and is based upon a fundamental study so 
that there is in it no misplaced emphasis. The teacher will 
need to use a score sheet for her pupils in order to locate the 
specific errors made by each pupil. 

1 They should be designated as "language tests," not "granunatical scales.'* 



The Measurement of Other Grade Subjects 209 

Grammar. — Standard tests in grammar ^ have been 
devised by Starch and Charters, and one is in preparation by 
Buckingham. The Starch grammatical tests have for 
their purpose the measuring of grammatical knowledge — 
Test I, parts of speech; Test 2, cases; Test 3, tenses and 
modes. These tests are of very doubtful value for use in 
the grades. The studies by Hoyt and Briggs ^ have shown 
that it is undesirable to teach formal grammar in the grades. 
The Charters test in grammar is the same as the language 
test, except that the pupil is required to give the grammatical 
rules upon which the corrections are based. It is doubtful 
if the giving of such rules is a desirable practice in the grades. 
It is not recommended, therefore, that the grammatical 
section of the Charters test be used. 

Music. — Seashore ^ has devised a prognostic test of musical 
talent, which has been perfected, and which should prove of 
unusual value. For a nimiber of years Dr. Seashore has been 
refining this test. The test discovers musical sensitivity, 
musical memory and imagination, musical intellect, and 
musical feeUng, with an accuracy which justifies a definite 
conclusion with reference to the musical talent of the pupil 
tested. As knowledge of this test becomes available, there 
is no doubt that it will be used more and more widely in 
discovering for further education the students of unusual 
ability, as well as those who have so little ability that it is 
useless for them to put further time on the study of music. 

BIBLIOGRAPHY 

1. Rugg, Earle Underwood, "Character and Value of Standardized 

Tests in History," School RevieWj 27: 757-771, December, 1919. 
An unusually helpful critical evaluation of present tests in history. 

2. Bell, J. Carleton, and McCullum, D. F., "A Study of the Attain- 

ment of Pupils in United States History," Journal of Ediicational 
Psychology y 8: 257-274, May, 191 7. 

^ See Bibliography at close of the chapter, 
p 



2IO How to Measure 

3. KeDy, Truman L., "Educational Guidance. An £iq)erimental 

Study in the Analysis and Prediction of Ability of Ffigh School 
Pupils," Teachers College, Columbia University, Contributions 
to Education, No. 71, p. 33. 

4. Buckingham, B. R., "Correlation between Ability to Think and 

Ability to Remember, with Special Reference to United States 
History," School and Society 5 : 443-449, April 14, 191 7. 

5. "Van Wagenen's Scales in United States History." Information, 

thought, and character judgment scales. Bureau of Publica- 
tions, Teachers College, Columbia University, New York City. 
A. S. Barr of 19 South La Salle Street, Chicago, is working towards 
a series of tests in United States History similar to the Van Wage- 
nen tests. 

6. "Starch's American History Test, Series A." Address, University 

of Wisconsin, Madison, Wisconsin. 

7. "Davis' Tests in United States History — Colonial History." Ad- 

dress, S. B. Davis, University of Pittsbiu-gh, Pittsbiu-gh, Pennsyl- 
vania. 

8. " Raynor's American History Tests." Address, Biu-eau of Edu- 

cational Research, University of Illinois, Urbana, Illinois. 

9. "Harlan Test of Information in American History." Address, 

Bureau of Cooperative Research, University of Minnesota, 
Minneapolis, Minnesota. 

10. Ballou, Frank W., "Geography; A Report on a Preliminary At- 
tempt to Measiu-e Some Educational Results," Boston School 
Document No. 14, 191 5. Tests not available. 

ri. Branom, M. E., and Reavis, W. C, "Completion Test for Measure- 
ment of Minimal Geographic Knowledge of Elementary School 
Children," Seventeenth Yearbook of the National Society for the 
Study of Education, Part I, pp. 27-39. 

12. "Hahn-Lackey Geography Scale." Address, H. H. Hahn, Wayne 

State Normal School, Wayne, Nebraska. Price, $.07 per 
copy. 

13. "Starch Geography Test." Price, $.02 per sheet. Address, The 

University Supply Association, Madison, Wisconsin. 

14. "Charters Diagnostic Test in Language and Grammar for Pro- 

nouns." Address, Bureau of Educational Research, University 
of Illinois, Urbana, Illinois. Price and postage : Language Edi- 
tion, $.60 per hundred copies ; Grammar Edition, $.90 per hun- 
dred copies. 

15. Hoyt, Franklin S., "Studies in English Grammar," Teachers College 

Record i November, 1906. 



The Measurement of Other Grade Subjects 211 

16. Briggs, Thomas H., "Formal English Grammar as a Discipline," 

Teachers College Record, September, 1913. 

17. Starch, Daniel, "Educational Measurements." Chapter 7 on 

English Grammar. The Macmillan Co. 

18. Seashore, C. E., "The Psychology of Musical Talent," Silver, Bur- 

dett & Co., Boston. 

19. Seashore, C. E., "Vocational Guidance in Music," University of 

Iowa, 1 9 16, First Series, No. 2. 

20. Seashore, C. E., "Avocational Guidance in Music," Journal oj 

Applied Psychology, i : 342-348, 191 7. 

21. Seashore, C. E., "The Measurement of a Singer," Science, 35 : 201- 

212, 1912. 



CHAPTER IX 

THE MEASUREMENT OF HIGH SCHOOL SUBJECTS 

It is possible that some day high school pupils will be so 
distributed, on the basis of ability, and so managed, by teachers 
who are thoroughly competent and sympathetic, that formal 
examinations will disappear entirely from the high school, 
or will appear no longer as formal examinations, but merely 
as review recitations, thus forming a part of good teaching 
rather than a part of an examination scheme. As long, 
however, as more or less formal examinations continue to be 
given, teachers and educators must continue in efforts to 
reduce the evils of the examination system by educating 
teachers in the matter of giving and grading examinations 
and by seeking in every way possible to standardize the tests 
which of necessity must be given. The standardized test, in 
almost every case, is more reasonable, shows fewer idiosyn- 
crasies, and shows more uniformity in grading, than do the 
ordinary unstandardized examinations. Apparently, there- 
fore, in secondary work, two movements must continue to 
develop — one looking toward the gradual elimination of the 
formal examination, the other looking forward to the standard- 
ization of such tests as are given. 

High School Tests. — Can the more strictly high school 
subjects, such as Latin, French, algebra, geometry, physics, 
ancient history, chemistry, general science, literature, 
composition, the commercial subjects, agriculture, home 
economics, and manual training, be measured by standard 
tests and scales? The answer is, " Yes, in so far as they are 
tool subjects or mechanical subjects." Tests have been 

212 
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developed and more or less fully standardized in algebra, 
geometry, physics, Latin, German, French, ancient his- 
tory, and commercial subjects. It is true that, with the 
development of the junior high school, the line between 
high school and grade subjects is not so clearly defined as it 
once was, so that in the junior high school, and even in the 
senior high school for that matter, some grade tests are 
being used in diagnosing the condition of the child with 
reference to the fundamental subjects or determining his 
efficiency for certain lines of work, particularly the commercial 
work. For such purposes, therefore, grade tests, such as 
diagnostic tests in arithmetic, reading for speed and accuracy, 
writing, spelling, and composition, are used to a considerable 
extent. 

Assuming that the new entrance age for secondary work is 
twelve or the arrival of adolescence, then we may expect 
that these tests will be used more and more for determining 
the condition of the child with reference to the mastery of the 
fundamentals of grade work. 

Why Tests Have Developed More Slowly in High School 
Work. — The work of developing standard tests in the 
secondary subjects has proceeded more slowly than in the 
elementary subjects. Since tests are developed chiefly by 
educational experts in colleges and universities, and since 
such experts are as interested in the secondary schools as in 
the elementary schools, in fact more interested from the 
standpoint of training teachers, it appears that the reason 
why standard tests and scales haye not been developed for 
high school subjects so rapidly as in the elementary subjects 
must be due to some intrinsic values or limitations involved 
in the subjects themselves. 

The reasons are: First, most of the high school subjects 
are not tool subjects. They are of value chiefly because of 
content and appreciative values. These values are more 
intangible, more difficult to measure; than the simple elements 
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involved in the tool subjects. Second, the old academic view- 
point that secondary work is merely preparatory is changing. 
The old viewpoint made the mastery of subject matter, as 
such, the essential consideration. The present tendency, 
however, is to minimize the importance of high school work 
as merely preparatory, to look more towards use and appli- 
cation, and to make of the high school a real people's school 
serving the broader aims of education. The efficiency of 
work on this basis cannot be tested nearly so well by exami- 
nation methods. Even a subject like mathematics does not 
become a tool subject for a large percentage of pupils. 
Apparently, therefore, appreciative values and an under- 
standing of the subject from the standpoint of enjoyment 
and perspective are just as important as the mere mastery of 
subject matter. Third, it is in subjects like literature and 
history especially that the fact, subject matter basis is par- 
ticularly undesirable. Literature, to be effective and to carry 
over into later life, must be taught on a basis of appreciation 
and enjoyment. It does not lend itself to rigid testing. 
History, likewise, deals with hfe problems, which depend 
for their development upon present-day problems, pupil 
interests, community contacts, and teaching equipment. 
So that any attempt to reduce history to a mere mechanical 
basis renders it of little value. 

In short, standard tests and scales have proven of value 
chiefly in measuring the tool subjects and the mastery of 
subject matter. The high school curriculum has many other 
values, some of which are possibly even more important than 
the strictly measurable ones. It will be worth while, how- 
ever, to note the development of scales in high school subjects 
in so far as they have developed. 

Algebra. — It is generally considered that algebra in its 
more fundamental processes is a tool subject which may be 
measured with a degree of accuracy approximating the simpler 
fundamental processes in arithmetic. The attempts to 
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formulate scales or standard tests in algebra have proceeded 
with greater rapidity than any other high school subject. 
Among the tests available are : the Monroe " Standardized 
Research Tests in Algebra," the Rugg and Clark " Stand- 
ardized Tests in First Year Algebra," and the Hotz " First 
Year Algebra Scales." 

1. The Monroe Tests are based upon the assumption that 
the equation is the central fact of algebra. The tests cover 
the simple operations as follows: Test i, removal of paren- 
theses; test 2, clearing the equation of fractions; test 3, 
solving for x^ a special case of division ; test 4, transposition ; 
test 5, collecting terms ; test 6, solution of simple equations. 

2. The Rugg and Clark Standardized Tests in First Year 
Algebra have been very generally used. They have been quite 
fully standardized. They will be found of interest to teachers 
of algebra, and, if properly used, will be of considerable value. 

The chief criticism of the tests is that they attempt to 
cover the subject as fully as it is covered by textbooks and, 
on the whole, seem a httle more difficult than necessary. 
Even so, they will be found much simpler and much more 
reasonable than tests which teachers ordinarily give. This 
will be apparent from an examination of a few of the tests. 

Test 4, in simple equations, follows herewith. 

1. 2:^-5-3 = 11 Answer 

2. 4c = 6 c+12 Answer 

3' 5^~3="~2o Answer 

4. 13 = 2:^—8 Answer 

5. 12 a: — 7 — 15 :it: = io Answer 

6. 3 x+4 = 16 Answer 

7. 8^ = 15^+14 Answer 

8. 7:^—6= —29 Answer 

9. i7=3:it:--8 Answer 

10. 9 :r--8--i3 :r = 7 Answer 

11. 5:^+2 = 27 Answer 

12. 6t = gt+2i Answer 
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13. Srr — 7=— 35 Answer, 

14. 19 = 5:^—9 Answer. 

15. 21 jc — 12 — 26 jc = i4 Answer. 

16. 4:^+5 = 17 Answer. 

17. 115 = 135+20 Answer. 

18. 9:r— 6=— 40 Answer. 

19. 23 = 6 jc — 9 Answer . 

20. 15 :r — II— 21 :it: = i8 Answer. 

21. 6:^+3=33 Answer. 

22. i2y = i^y+i2 Answer. 

23. lojc — 7=--33 Answer. 

24. 21 =8:^ — 5 Answer. 

25. i8:r--3 — 23 :it: = 9 Answer. 



• • • • 



Number attempted 
Number right 



Test number 5, parentheses, consists of 42 examples, as follows: 



I 
2 

3 

4 

5 
6 

7 
8 

9 
10 

II 

12 

13 
14 

15 
16 

17 

18 

19 
20 

21 



6(3 :r+8) Answer. 

5(4^^ — 2) Answer. 

— 3(4 rr — 2) Answer . 

—4(30;— 4) Answer. 

9( — 7 a; — i) Answer. 

— 8(--4:r — 7) Answer. 

8(5 a; +4) Answer. 

7(4 ^—3) Answer. 

— 5 (6 jc + 7) Answer . 
— 6(4 jc — 8) Answer . 
7( — 8 rr — 3) Answer . 
— 9(— 3^—6) Answer. 
7(4:^+6) Answer. 
6(3^ — 5) Answer. 
—4(7 x+$) Answer. 

— 5(6 jc — 7) Answer. 
7(— 9:^ — 2) Answer, 
— 9( — 5^— 4) Answer . 
5(6:it:+7) Answer. 
7(— 9:^ — 2) Answer. 

— 5(3^+8) Answer. 
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22. —7(8 x — s) Answer 

23. 8(— 6 :r— 3) Answer 

24. — 6( — 5 x—4) Answer 

25. 8(7 x+s) Answer 

26. 5(3 x—6) Answer 

27. —4(6 x+^) Answer 

28. —5(7 x—4) Answer 

29. 9(— 6 x — 2) Answer 

30. — 7(— 5 x—^) Answer 

31. 4(3 x+y) Answer 

32. 6(7 rr— 3) Answer 

33' -5(2 ^+8) Answer , . . . 

34. —4(3 x — y) Answer 

35. 8( — 7 x—^) Answer 

36. — 7(— 3 x—$) Answer 

37' 7(5 ^+9) Answer 

38. 5(8 x—4) Answer 

39. —6(5 x+2) Answer 

40. —9(4 x—6) Answer 

41. 7(— 6 rr— 4) Answer 

42. — 8(— 4 x—6) Answer 

Number attempted 

Number right 

Test number 8, factoring, contains 25 examples, as folio 

1. 5 o(?+is x? Answer 

2. a^— 64 Answer. 

3. y^—6 y+g Answer 

4. ft^+ii ft+28 Answer 

5. 5 0^+16 jc+3 Answer 

6. 6 a^+9 a? Answer 

7. o^ — it Answer 

8. ^ — 16 /+64 Answer 

9. a^+12 a+27 Answer 

10. 9 rc^+36 :it:+32 Answer 

11. gy^—t'f Answer 

12. h^ — 2$ Answer 

13. a^ — 14 a+49 Answer 



• • 



2i8 How to Measure 

14. m^+12 w+35 Answer 

15. 7 0^+26 :r+i5 Answer 

16. ?>x^ — i2 0(^ Answer 

17. 3;^ —81 Answer 

18. />2-.i8 ^^_8i Answer 

19. c^+ii c+30 Answer. 

20. 3 rc2+22 JC+35 Answer 

21. 12 3^+18 y* Answer 

22. ^^ — 2$ Answer 

23. r^ — io:r+25 Answer 

24. a^+io a+21 Answer 

25. 5 i!t;^+28 :r+i5 Answer 

Number attempted 

Nimiber right 

The time allowance for test 4, is 3 minutes, for test 5, 
2 minutes, and for test 8, 4 minutes. Time limits have been 
fixed for each test and tentative standards have been deter- 
mined. For example, the average of 27 schools on test 5 
was 10.4 examples attempted with 9.7 correct. 

3. The Eotz First Year Algebra Scales have been worked 
out on a scientific basis and the problems located on a point 
scale. This is an advantage to the extent that teachers 
understand its value. The problems differ from those in a 
standard test in that they are of increasing difficulty. The 
pupil is measured by the point which he reaches on the scale. 
This requires the scale to grow sufficiently difficult that he 
will at some point fail to go farther. The test in multi- 
plication and division, series B, which follows herewith, is 
illustrative. 

Multiplication and Division 

Carefully perform the operations as indicated. Reduce all 

answers to their simplest forms. 

12 n 
I. 3 • 7y= 2. = 
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3. 2a-4a62= 4. 6c^-^2c^ = 

5. fotgm=^ 6. — = 

4 or 

7. 4 ^ • (3 ^/) = 8. a^ . (-3 a) . (-2 a) = 

18 m^n — 27 ww^ 4 o:^ „ 

9. ^^ = 10. - — -^ 2 ar = 

9 mn 5 

II. (2 ^2+7 a— 9)(5a — 1)= 12. — — ^ = 

^2-3 

7a 7a2_ -12 r^/ • (^-2) _ 

15 20 3r^y5 

15. •— ^= 16. (-3^>'^)' = 

(^ — d^ c — d o 1 3 

17. r . = 18. ^ rri • 4 rri = 



19. 
20. 



a+2 



21. 



^^+27 _^ 3jH-9_ 
iJC^+rx; — 12 :r+4 

22. 641X27^ = 

3V6a 

23. ^ — —z. • 12 a = 

2aVi8 

There is no doubt that any of these tests in algebra will 
prove of value. They have been standardized, they permit 
comparison, they will be valuable for research purposes, 
and they have the advantage when used by teachers for 
promotion purposes of avoiding the unusually difficult prob- 
lems often used by teachers in final examinations. In 
other words they are more reasonable than tests usually 
given by teachers. Teachers frequently have erroneous 
ideas about the promotion of children; some even think 
it to their credit to fail a large number of pupils. If a pupil 
can pass simple tests such as the Monroe tests, he should be 
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permitted to go forward with advanced work. While the 
Monroe tests cover only the simple fundamental processes 
of first year algebra, the Rugg and Clark tests cover the entire 
field of secondary algebra. The tests will distribute pupils 
so as to show a teacher that she is instructing a group of 
pupils who differ widely in abihty and need help and drill on 
widely varying details. The wise teacher of algebra will 
keep for every pupil a card showing his mistakes or weak- 
nesses, such as mistakes in sign, errors in copying, errors in 
factoring, etc. The standard tests will further aid in locating 
pupils' weaknesses. Every teacher of mathematics in a 
high school and every superintendent should become familiar 
with at least one of the available tests in algebra. 

Geometry. — The Stockard and Bell ^ test in geometry 
consists of 70 questions arranged in 20 groups. In devising 
the test " the attempt was made to call for information that 
is to be found in all standard textbooks ; to test for important 
and fundamental principles of geometry ; to provide such a 
range of questions as to be representative of the whole field of 
elementary geometry, and to include memory facts, knowl- 
edge of content, organization of subject matter, and partic- 
ularly abihty to do originals ; and to confine the list to such 
dimensions that every question could be tried by the average 
high school pupil in 40 minutes." 

The 20 groups " involve drawing figures, naming figures, 
indicating order of development in demonstrations, complet- 
ing statements, stating the converse, definitions, regular 
polygons, parts of a demonstration, angular relations, area 
of a trapezoid, angles in polygons, angles in circles, con- 
gruency, similarity of triangles, loci, auxiliary lines, simple 
constructions, ratio and proportion, algebraic expression of 
geometric relations, and equivalent construction." 

It is evident that the authors have attempted to measure 
quite fully the student's mastery of the subject matter of 

^ See Bibliography at close of chapter. 
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elementary geometry. The test was given to 372 school 
students who had completed a year's work in geometry. 
About one third of the pupils tested were able to attempt all 
of the questions. On the basis of the tests given, the different 
questions are rated. The authors think the test not very 
practical for general high school use. It is too lengthy and, 
on the whole, a Uttle too difficult. Teachers may use it, 
however, for diagnostic purposes or purposes of research. 

Diagnostic Tests in Mattiematics. — Six tests have been 
selected by Anna L. Rogers.^ The tests, together with the 
time for explaining and giving the same, are as follows : 

(i) Algebraic Computation 12 minutes 

(2) Interpolation 18 minutes 

(3) Geometry 40 minutes 

(4) Superposition 7 minutes 

(5) Mixed Relations 8 minutes 

(6) Trabue Scales, L and J 12 minutes 

Total 97 minutes 

The total time required, 97 minutes, is just a little more 
than 2 regular high school periods. Yet in that brief time 
these tests enable a competent teacher to diagnose the mathe- 
matical abihty of ninth grade pupils with a view to improve- 
ment in the classification of students in the high school by 
eUminating from the mathematics classes those unfit for 
further mathematical training, and selecting those capable 
of progressing at a more rapid rate than the majority. The 
tests also serve to discover particular hnes of mathematical 
weakness. They may be given in the seventh and eighth 
grades, but, when so given, the time limits must be considerably 
extended. These tests are " designed to measure the more 
important phases of mathematical capacity demanded by 
high school mathematics, and, in particular, the ability to 
manipulate numerical and algebraic symbols, the ability to 

^ See Bibliography at close of chapter. 
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grasp and handle spacial relations, and the ability to deal 
effectively with words. They are of such a nature as to enable 
an inteUigent teacher to form an independent estimate of 
the pupil's mathematical capacity and likelihood of success 
in further lines of mathematical work. They measure 
original ability rather than effect of training." 

Miss Rogers has given directions for giving the tests and 
evaluating the scores, and has fixed tentative standards. 
She says, " As tentative standards, we suggest: (i) Where a 
pupil's score is greater than 150, he has capacity to progress at 
a more rapid rate than the ordinary high school student. 
(2) Where a pupil's score is less than —150, he shows inca- 
pacity to progress in mathematics at the rate of the ordinary 
high school student and, other things being equal, should be 
released from further training in the subject." The group 
coming between —150 and +150 is considered the normal 
group in high school mathematics. 

Henmon's Latin Tests. — Prof. V. A. C. Henmon of the 
University of Wisconsin has developed a series of vocabulary 
tests. A, B, C, and D, which are of equal difficulty and in 
which the words are arranged in order of difficulty. All of 
the words in these tests have been carefully evaluated. A 
series of sentence tests is also available, consisting of tests 
I and 2, of equal difficulty, and test 3, in which the sentences 
are all of approximately the same difficulty. Standard 
scores are given for both the vocabulary and the sentence 
tests. One test is required for each pupil. 

The Henmon tests in Latin have several advantages over 
ordinary tests given by teachers. In the first place, they are 
scientifically constructed, and they are based upon vocabulary 
which is common to Caesar, Cicero, Virgil, and 13 of the 
most frequently used first year Latin texts. In the second 
place, the tests are thoroughly standardized, making possible 
accurate grading and comparison with other schools, with 
other classes, or among pupils of the same class. In the third 
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place, such tests are helpful in the study and analysis of 
class work. This is well illustrated in the overlapping of 
abilities in successive years as shown by these tests. In test 
A the medians for the first year range, for different classes, 
from I to II, the median for all first year classes being 4. 
In the second year the class medians range from 4 to 19, the 
median for the year being 7. In the third year the class 
medians range from 3 to 25, the median for the year being 
20. If corresponding medians for various schools show such 
wide variation, the individual scores must evidently show 
very much greater range. The overlapping of abilities in 
Latin is thus seen to be comparable to the overlapping of 
abilities in other subjects. 

The administering of standard tests of this kind should 
discover the pupils of exceptional interest and ability on the 
one hand, and pupils on the other hand who, through lack of 
ability or interests, do so poorly that it is useless to have 
them continue the study. 

In so far as Latin is a tool subject, standard tests are 
applicable. In so far as the subject is of interest chiefly be- 
cause of other values, to that extent the teacher should be 
cautious in using standard tests or should use them only for 
her own enlightenment, being careful that they do not 
formalize her work. This will mean that the results of the 
tests are in general not brought to the attention of the pupils. 

Physics. — A physics test has been devised by Professor 
Daniel Starch, University of Wisconsin. It consists of 75 
mutilated sentences. They cover the 102 facts, principles, 
and laws of physics which the author has determined upon 
as the most essential. The basis for the determination was 
an examination of 5 widely used textbooks. The tests are 
easily administered. The value of the tests, however, has 
not been demonstrated. It is doubtful if physics can be 
reduced to facts in such formal fashion as this test would 
suggest. 
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Commercial Tests. — In the volume, "Commercial 
Tests and How to Use Them/' Sherwin Cody has brought 
together a summary of the use of tests in determining the 
relative standing of students graduating from the commercial 
departments of high schools. Tests are available, covering 
the following subjects : 

(i) Tabulating, mental alertness. 

(2) Reproducing instructions — designed to test memory and 
natural industry. 

(3) Invoicing. 

(4) Fundamentals of arithmetic, an adaptation of the Courtis 
tests. 

(5) Business arithmetic, including fractions, trade extensions, 
and percentage. 

(6) English, including spelling, elementary language, ad- 
vanced language, elementary punctuation, and advanced punctua- 
tion. 

(7) Letter writing. 

(8) Answering letters. 

(9) Stenographic tests covering transcribing and typewriter 
copying. 

(10) Copying for the mimeograph. 

(11) Addressing envelopes with the pen and filing. 

For each of the above lines of testing there are duplicate 
tests, full directions for administering the tests, keys for grad- 
ing, and tentative standards. The need of such conmiercial 
tests is evident to those who have attempted to select steno- 
graphic help or to evaluate the products of various commercial 
schools. The test in transcribing, for instance, is quite defi- 
nite. The student is expected to transcribe a standard 
business letter of 300 words in 5 minutes. This means 
60 words per minute. In many high schools, where there 
is no particular standard, students are permitted to graduate 
with a speed in transcribing of only 30 words per minute 
or even less. In like manner, the student can be accurately 
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checked on ability to file, answer a letter, spell, or use the 
fundamental processes in arithmetic. One of the main values 
of Mr. Cody's work is to suggest standards for commercial 
work, and all admit that this is a type of work which can be 
standardized. 

Sackett's Ancient History Scale. — Some will doubt the 
value of this test because it attempts to reduce a thought 
subject to a mechanical basis. Professor Sackett has at 
least shown the diflSculty of formulating an ancient history 
scale. His work is handled on an approved scientific basis, 
but if the premises are faulty the conclusion, of course, can- 
not be other than erroneous. What we want our students 
to get from ancient history is not a memory mastery of the 
facts, but, instead, an appreciation of the problems and the 
development of a method for the solution, not only of the 
problems of ancient history, but of present-day history. 

The scale (so called) consists of eight tests, each containing 
ten points. Test No. i, which is typical, is as follows : 

For what are the following men noted ? 

(i) Hannibal 

(2) Khufu or Cheops 

(3) Demosthenes 

(4) Darius 

(5) Solon 

(6) Charlemagne 

(7) Attila 

(8) Constantine 

(9) Mithridates 

(10) Justinian 

It is evident that this is a fact testing scale. Its general 
acceptance would quite surely be detrimental to the proper 
teaching of history. A student might have all knowledge 
as tested by this scale, and still be entirely lacking in the 
spirit and method of history. The test should be used with 

Q 
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extreme caution in order that it may not formalize the work. 
The caution should be even greater than in the case of the 
Latin tests referred to above, for history is in no sense a tool 
subject. The test should be used only for study and research 
purposes. The following illustration will make clear the 
meaning of research as here used : 

A teacher who had been teaching ancient history by out- 
line, endeavoring to master all of the facts in the book, was 
paired with another teacher who had the modem notion of 
motivation and problem organization of material, and the 
further notion that history was of value in so far as it had a 
bearing on the solution of particular problems at the present 
day. These teachers were encouraged to teach the same sub- 
ject in accordance with their respective ideas. At the end 
of the course comparison was made on the basis of the ques- 
tions used by the first teacher in former examinations. The 
results were entirely in favor of the second teacher. Possibly 
a fairer test and one even more favorable to the second 
teacher would have been a scientifically constructed stand- 
ardized test. On the other hand, if such standardized tests 
had been known to the second teacher in the beginning, it 
would doubtless have resulted in very much poorer work on 
her part because her work would have been formalized and a 
large part of the life taken out of it. At any rate, there is 
the possibiUty of real harm in using a formal test on a thought 
subject. 
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CHAPTER X 

THE MEASUREMENT OF GENERAL INTELLIGENCE 

" This is an exceptionally slow class," and " I have such a 
large number of children this year who are very slow," are 
expressions which the supervisor frequently hears as he passes 
from one classroom to another. These statements are based 
on actual facts in that they describe the condition of many 
children who are not making progress. The problem is a 
very real one to the teacher. 

It is also not infrequent to hear a teacher say, " I have an 
exceptionally bright class this year," or " I have4or 5 children 
who are far ahead of the others." It is unusual, however, 
to hear such statements followed with the remark, " I think 
certain children in this grade ought to be advanced to another 
grade." The latter situation should become more common. 

In either case the problem is one of knowing the child's 
mental age or general intelligence. He has been assigned to 
a certain grade chiefly on the basis of his chronological age 
or the number of years in school. He has been asked to do 
a certain type of work because children of his age who are 
normal are supposed to be able to do it. It may be that such 
children are being held back when they should be advanced, 
or the subject matter which is being presented in the regular 
class is not suited to them, and they are, therefore, in need 
of a different kind of subject matter presented in a special 
class. 

Suflftcient information is available to show that general 
intelligence tests can be used to determine the mental a^es. c^t 
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children, and that this information can be used as a basis for 
reclassification of the pupils. In a city school system in 
which the Binet-Simon (old form) Test was being used, it 
was foimd that a great many children who were too old chrono- 
logically were also too old mentally for their grade, e,g, when 
children enter the first grade at the age of six, they are accord- 
ing to correct practice of normal age chronologically if they are 
6 or 7 years old in the first grade, 7 and 8 years old in the 
second grade, etc. If a child in the first grade is 8 years old, 
he is I year too old chronologically. If he tests 8 years old 
mentally he is also i year too old mentally. He should, 
therefore, be in the second or third grade, provided his school 
life has been normal. If not, he should receive the special 
attention that will place him with his appropriate group as 
soon as possible. In the school system referred to above, out 
of less than scx5 children tested in 4 schools, 60 such children 
were found. All of these children were advanced one grade 
and all were able to continue with a good class standing in 
the grade to which they were promoted. 

In a small Iowa city 39 children out of a total of 177 chil- 
dren in 4 schoolrooms were advanced in the fall of 191 5, on a 
basis of a good class standing and being too old for the grade 
in which they were working. These children were selected 
from grades 5, 6, 7, and 8. Eleven of them were advanced 
one half of a grade, 28 were advanced a full grade. At the 
end of the first semester, February, 1916, not a single pupil 
failed in gaining promotion, and only 3 received a class stand- 
ing of less than 85%. The fact that such a large number of 
children in a school system can be advanced to advantage on 
a very general measurement, and that so little material is 
available showing the results of such practice, is indicative that 
a general intelligence test which will measure in very definite 
terms the general intelligence of large groups of children is 
greatly needed, and can be used to prevent waste and un- 
satisfactory results in classroom practice. 
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General intelligence tests have also been used for a number 
of years to select children who are subnormal to be placed 
in special classes in which they will receive a different type 
of instruction. Before such tests were available these 

children were com- Fig. 13. — grades mou Which 170 Childmn 

pelled to try to do no. ^^'^'^ ''^"^'' ''"' Mehtal Tests 

the same work as 
the normal child, 
which resulted in 
failure. After 
numerous repeti- 
tions had occurred 
it was realized that 
they could not ad- 
vance. They were, 
therefore, assigned 
to some other class 
in which they were 
permitted to do 
the thing they were 
able to do, but 
only after the loss 
of much time and 
energy to both 
pupil and teacher. 
With the aid of a 
general intelligence 
test these children 
can be located 

early in their school classes, thus reducin 
of waste. 

This tendency is clearly illustrated from Figures 13 and 14, 
which show the grades from which the children were selected to 
supply the special classes for subnormal children in a city of over 
100,000 after these classes had been in operation two years. 
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Fig. 14. — Gkahes 
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The error which has brought about the necessity for re- 
classification in the regular classes or in the special classes 
has resulted from an apparent assumption that children of a 
certain age group are of the same mental age. This principle 
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is seen in the practice of entering 6-year-old children into the 
first grade. At the end of one school year or at the age of 7 
years they are supposed to be ready for the second grade, and 
so forth. 

That all children of the same age are not of the same mental 
age is apparent from the following graph showing the mental 
ages of iSQ unselected children, all of chronological age 9. 

Fig. 15. — Distribution Accx)rding to Their Mental Ages of 159 Unselected 
Children (9 Years of Age), Binet-Simon Scales. 
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These 159 children make up the entire group of children 

9 years of age in a group of 743 children in 4 elementary 
schools who were tested with the Binet-Simon Test. Instead 
of all having a mental age of 9 years, only 48 showed exactly 
this age, 21 children had a mental age of a normal child at 

10 years of age, and 8 children a mental age of a normal 
child at II years of age, while 64 children showed a mental 
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age of a normal child 8 years, 17 children a mental age of 7 
years, and i child a mental age of 6 years of age. These 
figures show the same differences in abilities of children as 
are found in all unselected groups whenever a test that will 
distribute mental age is used. 

Although much has been written about the wide differences 
in the abilities of children and suflScient evidence given to 
support such conclusions, yet classroom practice too often 
lags behind because of the slowness with which usable tests 
have been developed and placed in the hands of the teachers. 
The teacher finds it exceedingly diflftcult to see and to plan 
for the needs of the class as a "whole. Likewise, the principal 
in his organization of classes is too much given to an organiza- 
tion on a basis of the needs of the group instead of the needs 
of the individuals in the group. Splendid attempts and good 
progress have been made in many places to recognize in a 
very definite way the individual difference among children, 
but in general, educational practice continues to handle them 
en masse on account of the lack of knowledge of group in- 
telUgence tests. 

A test in the hands of the teacher that will enable her to 
know the mental ages of the children in her class as she 
knows their chronological ages, will be a long stride toward 
classroom practice which handles children on a basis of their 
individual needs instead of group needs. Such tests are now 
available. The teacher can determine the mental ages 
of all the children in her class in the same time that was 
formerly required by an expert to determine the mental age 
of a single child. 

The following group intelligence tests are now available: 
Trabue Language Scales, not devised originally to measure 
general inteUigence, but found to do so with an accuracy that 
makes them very valuable; the Otis Group InteUigence Scale; 
Haggerty's Intelligence Examinations, Delta i and Delta 
2 ; and Whipple's Group Tests for the Grammar Grades. 
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Trabue Language Scales 

Aim. — It has been found that the Trabue Language Scales 
will measure the general ability of pupils. 

Description of Tests. — The Trabue Language Scales consist 
of scales B, C, D, E, J, K, L, and M. Scales B, C, D, and 
E are practically equal to one another. They are intended 
to be used in pairs — B with C, and D with E — by teachers 
in determining the abilities of children between the ages of 
7 and 20. Each scale is made up of 8 to 10 sentences with 
words omitted which are to be supplied. " The first sentence 
in each of these 4 scales is about i unit above an arbitrary 
zero point, the second sentence is approximately i unit more 
dijficult than the first, and so on until the last sentence in 
each scale is about 11 imits above zero." 

Scales J and K are intended to measure the ability of 
adults, and are, therefore, of practically no value for public 
school purposes. 

Scales L and M, which are not equivalent to scales J and 
K, or to any other pair, are intended to measure the abilities 
of high school students. " They have no very easy sentences, 
and the differences between the sentences are relatively small." 

The grouping into pairs of these scales of equal value 
provides a duplicate test for checking. 

Below is given a copy of Language Scale B to show the 
nature of these scales : 

Name 

Write only one word on each blank. Grade 

Time limit, seven minutes. Age (on last birthday) ...... 

TRABUE 
Language Scale B 

1. We like good boys girls. f 

2. The is barking at the cat. 

3. The stars and the will shine tonight. 

4. Time often more valuable moneY^ 
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5. The poor baby as if it were sick. 

6. She if she will. 

7. Brothers and sisters always to help other 

and should quarrel. 

8. weather usually a good eflFect one's spirits. 

9. It is very anno3dng to tooth-ache, often 

comes at the most time imaginable. 

10. To friends is always the it takes. 

Giving the Test. — The process of giving the Trabue Lan- 
guage Scales is very simple. A preliminary test is provided 
with simple sentences in order to make clear to the children 
exactly what they are expected to do. After this preliminary 
test has been given and explained fully, the children are ready 
for the regular test. 

Each child is provided with a scale on which he writes his 
name, grade, and age. On each sheet is the instruction, 
" Write only one word on each blank." Attention should be 
called to the fact that he will be given a time limit (7 minutes 
for scales B, C, D, and E ; 5 minutes for L and M) in which 
to do as much as he can. He will possibly not be able to 
fill all the blanks. As soon as the time limit has expired 
see that every child stops work and all papers are collected. 

Scoring Results. — For the convenience of the teacher and 
the accuracy of the results, the author has provided a detailed 
scheme for scoring the answers to the different sentences, 
which is as follows: 

General Scheme 

Score 2 

" A score of 2 points is to be given each sentence, completed 
perfectly. Errors in spelling, capitalization, and punctuation 
should not be allowed to affect the score. 

Score I 

" A score of i is to be given each sentence completed with only a 
slight imperfection. A poorly chosen word or a common gram- 
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matical error, which makes the sentence less than perfect and yet 
leaves it with reasonably good sense, should serve to reduce the 
score from 2 to i. 

Score o 

" A score of o is to be given if the sentence as completed has its 
sense or construction badly distorted. A sentence must have 
reasonably good meaning and express a sentiment which might 
honestly be held by an intelligent person in order to receive a higher 
credit than zero." 

The following is a sample of the answers provided in this scheme, 
for the first three sentences of Scale B : 

Language Scale B 

1. We like good boys girls. 

Score 2 

and, an 
Score I 

or, not, and good, also. 
Score o 

for, with, said the, and the. 

2. The is barking at the cat. 

Score 2 

dog, hound, pup. 
Score I 

dogs, boy. 
Score o 

man, cat, god. 

3. The stars and the will shine to-night. 

Score 2 

moon. 
Score I 

light, planets, lights. 
Score o 

dipper, stripes, clouds, city, sky, sun. 

etc. 

The score for each sentence should be recorded on the margin 
of the test. After all the test papers have been scored^ <fc& 
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scores are transcribed to a Class Record Sheet. Below is 
given a copy of the Class Record Sheet provided by the author 
with the scores from a 4-A class in a city school system. 

Table 49 
Class Record 
Date Jan.' 14 '30. Completion Tesl-Languftge Scales. Language Scale B. 
City D. State M. School F. Room No. g. Grade 4-A. Teacher H. S. 
Test GItsq b; H. S. number of Pupils tikjng test 35. Humber regolarlj 
enrolled pupils not taldng this test □. 
Test began at 11 : 30, closed at 11 : 3;. Time allowed 7 niin. Dnusual conditioni 
trhich might influence results of this test, none. Scores Assigned b; H. S. 
recorded by H. S. 
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This record is read as follows: S. L., age lo years, 2 
months, made a score of 2 on the second sentence and a score 
of zero on all the others which gives him a score of 2 ; W. B., 
age n years 3 months, made a score of 2 on sentences 2,3, 
and 6 and a score of zero on all the others maiiLing a total 
score of 6 ; etc. These scores are then distributed and the 
class median is determined, which for this class is 10.6. 
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Interpreting and Using Results. -^ After the class records 
have been made and the class scores determined, the next 
problem for the teacher is to interpret her results and apply 
them to her classroom practice. This can be explained best 
by reference to concrete situations. 

The first point to be determined is the relation of the class 
score to any class standards. The class standards for the Tra- 
bue Language Scale as reported by the author are as follows : 

Table 50 
Standard Language Scale Scores 



Scales 
B, C, D. E. F. 


Grade or Class 


Score 
(Median) 


Half 


Score 




II 


4.8 


IIB 
IIA 


3.8 
5.8 




ni 


8.0 


niB 

1 IIA 


7-4 
8.6 




IV 


lO.O 


IVB 
IVA 


9.6 
10.4 


^ 


V 


II.4 


VB 
VA 


II. I 
11.6 




VI 


12.4 


VIB 
VIA 


12. 1 
12.6 




VII 


13-4 


VI IB 
VIIA 


13.1 
13-6 




vm 

H. S.I 
H. S. II 
H. S. Ill 
H. S. IV 


14.4 

15.2 
16.0 
16.7 
17.4 


VTHB 
VlllA 


14.1 
14.6 




Tentative Standards in Scales 
J& K L & M 




7-5 
8.6 

9.4 

lO.O 


9.2 
10.5 
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The class score for the 4-A grade reported on the above 
class record sheet, Table 49, is 10.6. The standard score for 
this grade is 10.4. This class is, therefore, slightly above the 
standard. 

When the test has been given in other classes of the same 
city further comparisons can be made. Below are given the 
class scores from the 4-B grade in the same school and from 
the 4-A and 4-B grades in another school : 

Table 51 



School 


4-B 


4-A Grade 


I 


9.6 

10.2 

9.6 


10.6 


2 


II.8 


Standards 


10.4 



It is seen, therefore, that the class reported in Table 49, 
although above the standard, is below the 4-A grade of 
the other school, which attained a score of 11. 8. A fur- 
ther analysis shows that these four classes have made 
scores equal to or above the standards, and also that school 
number 2 scored considerably ahead of school number i. 

After the score for the entire class has been secured and 
interpreted the scores of individual pupils should be analyzed 
to ascertain whether or not all pupils are properly placed and 
are receiving proper instruction. For this purpose the class 
record sheet showing the score of each pupil should always 
be kept available for frequent reference by the teacher. 

By referring to the record of the class in Table 49, it will 
be seen that the lowest scores made by any boy or girl are 2 
and 8 respectively, and the highest 15 and 16 respectively. 
The lowest score made by any pupil is a score of 2 made by a 
boy, S. L., whose chronological age is 10 years, 2 months, and 
whose mental age as later determined by the Stanford Revi- 
sion of the Binet-Simon Test is 8 years, 2 months, or an Intelli- 
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gence Quotient of 80.3. The next lowest score is 6, which is 
also by a boy, W. B., whose chronological age is 11 years, 3 
months, and whose mental age on the Binet-Simon Test is 9 
years, 3 months, or an Intelligence Quotient of 82.2. The 
highest score made by any boy is 15, which was made by A. G., 
whose chronological age is 10 years, o months, and whose 
mental age is 10 years, o months, or an Intelligent Quotient of 
loo.o ; and the highest score of any pupil is 16, which was made 
by a girl, L. J., whose chronological age is 10 years, 8 months, 
and whose mental age is 12 years, i month, or an Intelligence 
Quotient of 113.2. It is evident, therefore, that some of 
these children should receive special attention. The pupils 
who made the low scores are not benefiting from the class 
instruction to the extent they should. They should be placed 
in a special class where more individual or perhaps a different 
kind of instruction can be given. 

The question should likewise be raised in connection with 
the examination of any class, as to whether the pupils who 
make the highest scores should be advanced to another grade 
or be assigned to a faster group. In the group above it would 
seem that the girl, L. J., at least should be given such con- 
sideration. 

With the aid of the Trabue Language Scales the teacher 
can quickly determine the general intelUgence of the class as 
a whole and also the general intelligence of each pupil in her 
class. If there is a question of doubt about certain pupils, 
the results from the Trabue Language test can be checked 
by a more refined measurement, such as the Stanford Revision 
of the Binet-Simon Test. With such knowledge about the 
mental ability of her pupils the teacher can classify her pupils 
so that her instruction can be more effective. 

Otis Group Intelligence Scale 

This scale has been devised in response to a wide demand for 
a test which will determine the general mental ability oi 

R 
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children in large groups. Since the ability to read is required 
to take this test, it is not applicable to persons with less than 
3 or 4 years of schooUng. Of this test Dr. Lewis M. Terman 
says : " With subjects of this much schooling, the Otis Scale 
probably comes as near testing raw ' brain ' power as any 
system of tests yet devised.'' 

The Aim. — The aim of this scale is to determine a pupil's 
general mental ability. It is expected that the Otis Scale 
will be used for school purposes, to classify, quickly and effi- 
ciently, large groups of children on a basis of their mental 
ages, in order to meet more adequately their individual 
needs, and that the Binet-Simon Test and others will be 
used to supplement this scale in cases which are in doubt, 
or which call for more reJ&ned measurements. 

Description of the Test. — The Otis Group Intelligence 
Scale is divided into two forms, A and B, which are different 
in substance but similar in structure. Each form is in a 
separate booklet. By this means the same group of children 
can be examined at different times without a knowledge of 
the tests affecting the results. The total point score for each 
is the same. Each form has ten tests, as follows : 

Number Time LiMir 

Test I Following directions 5 minutes 

Test 2 Opposites i^ minutes 

Test 3 Disarranged sentences ij minutes 

Test 4 Proverbs 6 minutes 

Test 5 Arithmetic 6 minutes 

Test 6 Geometric figures 6 minutes 

Test 7 Analogies 3 minutes 

Test 8 Similarities test 4 minutes 

Test 9 Narrative completion 6 minutes 

Test 10 Memory 3 minutes 

The scale can be used with children in grades 4 through the 
high school, and even with university students if desired. 
Giving the Test. — Any person who is able to teach can, 
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after a little study, apply these tests with a sufficient degree 
of accuracy to insure satisfactory results. Before attempting 
to give the tests, however, the teacher should practice on the 
instructions given in the Manual of Directions, which should 
always be available. Each child must have a copy of the 
scale in booklet form. The instructions for each test are 
written at the top of the test, but divided from the test by 
a heavy black line. Too much care cannot be exercised in 
seeing that the children follow specifically the instructions as 
outlined. 

Scoring Results. — An examiner's key on transparent 
paper is provided, which makes the scoring of the papers a 
very simple matter. For the scoring of all tests except test 
3, the check mark (V) opposite each correct answer can be 
used. The sum of the number of checks will be the score of 
the individual on the test. 

In scoring test 3, a check should be placed after each 
correct answer and a cross after each incorrect answer only. 
No attention need be paid to omitted answers. The score 
will be the number of correct answers minus the number of 
incorrect answers ; that is, " the number of checks minus 
the number of crosses." (For more detailed instructions 
for scoring, see pages 29 and 30 of the Manual of Directions, 
1 91 9 edition.) The sum of the scores on each individual test 
will give the individual's score. This score can be placed on 
the front page of each individual's test sheet, or it can be 
transferred to a record sheet on which the name of each child 
can be written and the score on each test, together with his 
total score, placed opposite his name. 

The score of each pupil can be expressed in terms of first, 
mental age; second, intelligence quotient; third, percentile 
rank ; fourth, coefficient of brightness. 

To date no age norms are available from which a child's 
mental age can be determined. The author, however, is 
collecting results from these tests wherever they are ©vecL^ 



244 B<^^ ^ Measure 

and undoubtedly will have such age norms in publication 
in the very near future. 

To secure an age norm for the group tested the exami- 
nation booklets are arranged according to the exact ages of 
the children. " To do this it will be necessary to take account 
of the date of the birthday. The Total Score Norm fot the 
age of 12 years may then be taken to be the average of the 
Total Scores of all pupils whose ages were between ii years, 
no months, and 13 years, no months. The Norm for the 
age of 12 years, i month, may be taken as the average of the 
Total Scores of all pupils whose ages were between 11 years, 

I month, and 13 years, i month, etc. The Mental Age of a 
pupil may then be seen at a glance by noting the age for which 
his Total Score is the Norm.'' 

The intelligence quotient of a pupil up to 16 years of age 
can be secured by dividing his mental age by his chronological 
age. Beyond 16 years of age, the mental age is divided by 
16, for the reason that an individual is practically mature 
at 16 years of age. For a further discussion on determining 
the scores, especially the percentile rank, and the coefficient 
of brightness, reference should be made to pages 32 to 36 of 
the Manual of Directions. 

Interpreting and Using Results. — In order to indicate to 
the teacher how she can determine the general intelligence of 
her pupils with the use of this test, the following results are 
given which were secured from two 4-B teachers and two 4-A 
teachers in a city school system. In all 104 children were 
tested. After the scores were obtained, the papers were 
classified according to the age groups, i.e. all the papers of 
children 8 years, o months, to 10 years, o months, were placed 
in one group ; all the papers of children 9 years, o months to 

II years, o months, were placed in another group, etc. The 
average of the scores on the papers in the first group gave 
the age norm of the 9-year-old children; the average of the 
scores on the papers in the second group gave the age norm 
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of the 10 year old children, etc. The normal chronological 
ages for the fourth grade are 9 and 10 years. The age norms 
for these two groups in these four classes are 54.4 and 
54.6 respectively. 

The small number of papers makes these figures only 
tentative norms. These norms would undoubtedly be changed 
by a larger mmiber of papers, which are necessary to estabUsh 
reliable standards. 

After the age norms for the different ages are secured the 
mental ages of the different individual pupils can be obtained 
by noting the age for which the pupil^s total score is the norm. 
For example, R. T., a 4-A pupil, 10 years, i month, made a 
score of 54. His mental age is, therefore, almost 10 years. 
Another 4— A pupil, S. H., 10 years, 4 months, made a score 
of 51. He is, therefore, less than 10 years old mentally. 

Until there is an age norm for children at every age including 
years and months, the exact mental ages cannot be determined. 
When that information is available to the teacher, she can for 
all practical purposes determine the mental ages of her children 
whereby a far better grouping or classification can be secured 
than on the basis of the chronological ages. 

Haggerty's Intelligence Examinations: 
Delta I and Delta 2 

Aim. — The purpose of this test is to measure the native 
ability of groups of pupils in the elementary school in order to 
group them properly or in a limited way to measure their 
progress. 

Description of Tests. — The tests appear in two pamphlets, 
the one. Intelligence Examination: Delta i, for grades one to 
three inclusive ; and the other. Intelligence Examination : Delta 
2, for grades three to niiie inclusive. Delta i contains the 
following exercises : 

Exercise i. Oral Directions 
Exercise 2. Oral Directions 
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Exercise 3. Cop3dng Designs 

Exercise 4. Copying Designs 

Exercise 5. Picture Completion 

Exercise 6. Picture Completion 

Exercise 7. Picture Comparison 

Exercise 8. Picture Comparison 

Exercise 9. Symbol Digit 

Exercise 10. Symbol Digit 

Exercise 11. Word Comparison 

Exercise 12. Word Comparison 

Exercises 2, 4, 6, 8, 10, and 12 determine the pupil's score; 
the others are preliminary exercises and are not counted in 
scoring. Simple instructions are given to the teachers for 
the different performances under each exercise. The diflS- 
culty which small children would encoimter in reading or in 
following complicated instructions is avoided. Delta 2 is an 
adaptation of the army intelligence tests. It has been used 
more widely than Delta i. In addition to the examination 
of 15,000 school children in the state of Virginia, it has been 
used extensively in many of the larger city school systems 
throughout the country. It consists of the following exercises : 

Exercise i. Sentence Reading 

Exercise 2. Arithmetical Problems 

Exercise 3. Picture Completion 

Exercise 4. Synonym-Antonym 

Exercise 5. Practical Judgment 

Exercise 6. Information 

The first 5 performances of Exercises i and 2, Delta 2, are 
given to show the nature of the tests : 

Exercise i 
Directions : 

1. Read this question : Do cats see? No Yes 

The right answer is Yes ; so a Une is drawn under Yes. 

2. Read the next question : Is coal white? No Yes 

The right answer is No ; so a line is drawn under No, 
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Below are a great many more questions. Read them carefully, 
one at a time, and draw a line under the right answer. When 
you are not sure, guess. 

1. Do dogs run ? Y^ No 

2. Can a doll sing? Yes No 

3. Does the sun shine ? Yes No 

4. Do men drink water ? Yes No 

5. Are all apples red? Yes No 

Exercise 2 

Get the answers to these problems as quickly as you can. Use 
the side of this page to figure on if you need to. 

Samples. — i. How many are 5 men and 10 men? Answer (15) 
2. If one pencil costs 5 cents, what will 

4 pencils cost? Answer (20) 

1. How many are 30 men and 7 men? Answer ( ) 

2. A boy had 10 cents and spent 4 cents. How 

many cents had he left ? Answer ( ) 

3. If you save $7 a month for 4 months, how much 

will you save ? Answer ( ) 

4. If 24 men are divided into groups of 8, how many 

groups will there be ? Answer ( ) 

5. A boy had 12 marbles. He bought 3 more, and 

then lost 6. How many marbles did he have 

left ? Answer ( ) 

Giving the Tests. — A carefully devised manual of directions 
is provided by the author which must be in the hands of the 
teacher and thoroughly understood by her before any attempt 
is made to give the tests. The instructions in the manual 
are simple so that no teacher should have any difficulty in 
applying the tests to her class. Each child in the class must 
be provided with a test. The entire class can be examined 
at once. 

Scoring the Results. — The Manual of Directions also 
provides explicit instructions for scoring the tests. A Scor- 
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ing Key is provided for both tests. The score of each pupil 
is the sum of the scores made on the several items of the 
test. The maximum score for each test is as follows : 



Delta I 



Delta 2 



Exercise 


Maximum Score 


Exercise 


Maximum Score 


2 

4 

6 

8 

10 

12 

Total . . . 


10 
10 
16 
20 
48 

25 
129 


I .... . 
2 

3 

4 

5 

6 

Total . . 


40 
20 
20 

40 
16 
40 

176 



After the score for each pupil is determined these scores are 
transferred to a class record sheet and the median class score 
is determined. The results are given in terms of median 
scores and age norms for each grade. 

Using the Results. — These Intelligence Examinations 
have been used with a sufficiently large group of pupils to 
insure standards that are exceedingly valuable for comparative 
purposes. Test Delta 2 has been u^ed with about 20,000 
children and Test Delta i with 4000 children. The following 
tentative standards are available : 

Table 52. — Grade Standards for General Intelligence Test: 

Delta i 




Grade at end of year 
Score 



Table 53. — Age Norms for General Intelligence Test : Delta i. 



Age . 
Score 




10 
75 
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Table 54. — Standard Scores in General Intelligence Examina- 
tion: Delta 2 for Each of Grades 3 to 9 Inclusive 



Grade 
Score 



3 


4 


5 


6 


7 


8 


40 


60 


78 


96 


no 


120 



9 
130 



Table 55. — Age Norms for General Intelligence Test : Delta 2 



Age . 
Score 



8 


9 


10 


II 


12 


13 


14 


15 


25 


43 


55 


66 


77 


87 


100 


115 



Table 56. — Standard Scores in Exercises i and 2 : Delta 2, for 

Grades 3 to 9 Inclusive 



Grade 
Exercise i 
Exercise 2 



3 


4 


5 


6 


• 

7 


8 


14 


20 


23 


27 


30 


32 


5-0 


7.0 


9.0 


10.5 


II. 5 


13-0 



9 
35 
15-0 



From the above standards the teacher can tell whether 
or not her class as a whole measures up to the standard for the 
grade. She can also determine the mental age of each m- 
dividual pupil in her class, whether or not they are above or 
below the standard for the grade. With the aid of such a 
test the teacher should have no difficulty in determining the 
native ability of each individual pupil in her class. 

The Group Tests for Grammar Grades, hyldr. G. M. Whipple,^ 
are group intelligence tests similar in many respects to the 
Otis and Haggerty tests. The purpose of these tests is to 
select the brighter pupils from grades 4, 5, and 6, but the tests 
quite fully distribute pupils in these grades according to 
intelligence. It takes 92 minutes to give the tests to a sixth 
grade, the entire room being tested at once. But when the 
papers are scored, the teacher has before her a fair measure 
of the intelligence of each pupil in the group. 



^ See Bibliography at close of chapter. 
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The Standard Revision of the Binet-Simon Test 

This test is possibly the most accurate instrument so far 
available with which to determine the native ability of 
American children. On account of the fact that from 30 to 
60 minutes are consumed in examining each pupil and that 
special training is required by the one applying the test, it 
cannot be used for the examination of large groups of children 
in a limited time. After the group tests suggested above have 
been given, there will always be questions which can be settled 
only by a more reJ&ned measurement such as the Stanford 
Revision of the Binet-Simon Test. It is not too much to 
expect that the group intelligence tests will open the way and 
extend the use of the latter test. 

An illustration of how this test will supplement the group 
tests is given in Table 57. The teachers in two 4— A classes 
in a city school system examined their children with the 
Trabue Language Scale. In all 44 children were tested. 
All of them were also examined by a psychologist with the 
Stanford Revision of the Binet-Simon Test. 

The standard for the 4— A grade on the Trabue Scale is 10.4. 
Nineteen of the 44 children made a score below this standard. 
On the Binet-Simon Test all but 10 of these 44 children made 
an Intelligence Quotient above 80. 

Table 57 gives the scores on the Trabue Language Scale and 
the Binet-Simon Test for those 18 children who scored below 
the standard for the Trabue Language Scale. 

From these figures it is seen that 9 of the 14 children 
scoring from 8.0 to 10.3 inclusive on the Trabue Scale tested 
between 80 to 90 Intelligence Quotient or below. In the 
group between 80 and 90 Intelligence Quotient are found, 
according to Terman, *' those children who would not accord- 
ing to any accepted social standards be considered feeble- 
minded, but who are nevertheless far enough below the 
actual average of intelligence among races of western 
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European descent that they cannot make ordinary school 
progress or master other intellectual difficulties to which 
average children are equal." Of the children scoring 
below 8 on the Trabue Language Scale all except one 
(Intelligence Quotient 81.8) tested below 80 Intelligence 
Quotient. 

Table 57 



Trabxte Language Scale 


Intelligence Quotient on Binet-Sdcon Test 


6.3 


75.8 


6.6 


81.8 


7.6 


75.5 


7.6 


75- 


8. 


89.9 


8.3 


89.9 


8.3 


81.7 


8.6 


lOI. 


8.6 


66.1 


9- 


93.1 


9.3 


77.3 


9-5 


71.5 


9.6 


73-3 


9.6 


90.9 


10.3 


84.8 


IO-3 


96.9 


10.3 


98.3 


10.3 


80.9 



In this particular city it is the practice to consider pupils 
eligible to a special class when they test below 80 Intelligence 
Quotient. It would seem, therefore, that by having the 
psychologist test with the Binet-Simon Test the children 
who scored below 10.3 on the Trabue Language Scale, those 
children could easily be detected who should receive special 
instruction. 

In the same way the children whp te^t very hi^ ^^'^ ^^Jo^ 
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Trabue Language Scale should be given an individual examina- 
tion for further classification. 

By combining the use of the Binet-Simon test with such 
group intelligence tests as the Trabue Language Scale, the Otis 
Intelligence Scale, the Haggerty Intelligence Examinations: 
Delta I and Delta 2, and the Whipple Group Tests for the 
Grammar Grades, classroom practice can readily be placed 
on a more scientific basis. The availability of such tests 
makes it possible for every teacher to know the mental age 
of every child. 

The training of a teacher in a normal school should include 
a course in the testing of general intelligence so that every 
teacher may apply any general intelHgence test with a reason- 
able degree of accuracy. Moreover, every teacher could 
well afford to spend six weeks in the sunoimer in an institu- 
tion where such training could be secured. 

On account of the detailed instructions necessary for the 
application of the Stanford Revision test, no person should 
attempt to give it without having access to the " Measure- 
ment of IntelHgence '' by Terman, or some book like it in 
which such instructions are given. 
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CHAPTER XI 

STATISTICAL TERMS AND METHODS 

The purpose of this chapter is to give only so much informa- 
tion from the science of statistics as the teacher needs to know 
in order to administer a test, tabulate the scores, and interpret 
the results. This will necessitate, also, the explanation of 
statistical terms sufficiently to enable one to understand such 
terms when used in the discussion of the measurement of any 
school subject. 

Securing Comparable Results. — One decided advantage of 
a standard test is the possibiUty of comparing the results with 
similar results in other rooms, other school systems, or with 
tentative or fixed standards. Manifestly, such comparisons 
can be made to advantage only when the tests have been 
given under similar conditions. The following suggestions 
may be considered as rules of the game for securing com- 
parable results: 

1. In giving a test it is essential that the conditions of 
the test be kept constant. 

2. The directions which accompany a test should be followed 
in every detail. If possible, use a stop watch to secure exact 
time, when there is a time limit. 

3. It is an advantage if the examiner has a clear conception 
of the nature of the test, its purpose, and the use to be made 
of it. 

4. At the time of giving the test all needed secondary data 
should be secured, such as name, age, grade, school, date, etc. 

5. Most tests as a part of their instructions provide for 
a preliminary trial in order to make pupils familiar with the 
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test. In case such provision is not made in the instructions, 
the teacher should devise a preliminary test which should be 
similar but somewhat easier than the one to be given, in 
order that the pupils may thoroughly understand what is to 
be done and how to do it. 

6. The test should be handled as nearly as possible just 
as any other regular lesson. An appeal to extra effort is 
allowable, but other comiments likely to secure results that 
are not normal should be avoided. Appeals which are made 
to the child's desire to do well in the test should be included 
as a part of the regular instructions, in order that conditions 
of the test may be uniform for comparisons. 

7. For many purposes a single test is sufficient. In case the 
decision to be based upon a test is of unusual importance, 
at least two specimens should be taken, or two tests given, or 
the judging be done by at least two competent judges. In case 
there is a decided discrepancy between the two results, the 
teacher will realize that further testing should be done. As 
will be apparent from further study of statistical methods, a 
score for a class is much more reliable than for an individual, 
and the score for an entire city more reliable than that for a 
single class. This is due to the fact that slight errors tend to 
balance each other in such a way as to give a more accurate 
judgment on a large group than on a small group or a single 
individual. 

8. Care should be taken not to use the material of stand- 
ardized tests for practice purposes. 

9. In case the test is given frequently the results will be 
much more representative if an alternative test of equal value 
has been provided by the person who devised the test. 

Using a Standardized Test. — Teachers to-day can scarcely 
attend an educational meeting or read an educational magazine 
without hearing about scales and standardized tests, and their 
advantages in measuring the work of the schools. For the 
teacher thus interested, but who has not had a norcasal <^<:fcRRk 
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or college course in educational measurement, the following 
directions are given with the assurance that an intelligent 
teacher may go forward in such work even though she may not 
have the help and guidance of a trained supervisor. 

I. Selecting the Test. — In selecting the test to use, the 
teacher may well be guided by the particular purpose which 
she has in mind. The preceding chapters deaUng with the 
available tests in different subjects will permit the teacher to 
make a choice on the basis of the best test for the particular 
purpose. In general, those tests should be chosen which have 
been most widely used, and which require the least time for 
giving and marking the papers. Yet this is not an infallible 
rule. The Woody tests are certainly much more valuable 
than the Courtis tests in arithmetic. Yet the Woody tests 
have been given very little in comparison with the wide use 
of the Courtis tests. They are a littie more difficult to 
administer and to score. Yet, from the standpoint of value 
in diagnosing the pupils' difficulties, they are far superior to 
the CcJurtis tests. The tests that are going to survive and 
show value in the next few years cannot be determined at 
this time. The final judgment upon the test must be passed 
by the teacher in the schoolroom on the basis of its value in 
helping her in her work of discovering the needs of the children 
and applying the appropriate remedies. It may be properly 
assiuned that, although a test is more difficult and requires a 
longer time, if it is superior in every respect, the teacher will 
find the time for giving it. It requires considerable time to 
give Gray's Oral Reading Test, yet the results of giving the 
test are so valuable that the teacher does not hesitate to take 
the necessary time for giving it. 

When a test may be chosen on the basis of difficulty, as in 
the use of the Ayres Spelling Scale, the teacher should keep 
in mind that a good test should be so difficult that no pupil 
will make a perfect score, and sufficiently easy that most pupils 
in the grade will secure a score which is reasonably satisfactory. 
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2. Giving the Test. — In giving a test the teacher should 
follow carefully the printed directions which accompany the 
test. This is the chief rule to keep in mind. Other details 
are mentioned above under " Securing Comparable Results." 
The teacher who has time and is willing to experiment may 
easily demonstrate the possibility of changing a score by a 
slight change in directions or by a diJBferent attitude in present- 
ing the work to children. The chief consideration, if com- 
parisons are to be made, is that the attitude, detailed directions, 
and every element entering into the giving of the test shall be 
as indicated in the directions, so that pupils in one city, or 
even in one state, may be compared with those in another. 
In handwriting, for example, pupils should be so instructed 
and handled that they will write at their natural rate, thus 
securing results in the test that will represent the normal 
situation. 

3. Scoring the Papers. — Every test provides printed 
directions for scoring the papers in order to aid teachers in 
securing uniformity of results. These printed directions 
should be followed impUcitly. If the teacher has opinions 
as to what should be done, and these opinions are different 
from the directions, such opinions should be abandoned if the 
results of the test are to be used for comparative purposes. 

The teacher is urged to have the pupils aid in scoring the 
papers in so far as it is possible. This can be done very 
largely in arithmetic, in spelling, in certain reading tests, 
and, to an extent, in writing. The chief purpose of involving 
the child in the grading is to further increase his interest. 
This is an incentive and a motive which is worth while for 
teaching purposes, and which will lead the child to greater 
effort in order that he may score higher in a future test. 

4. Tabulating Results. — Directions for tabulating results 
or distributing the scores are provided in connection with 
most of the tests. A common method of making a dis- 
tribution is to arrange the papers in ordex. Tw^ V^'^y?^^:^ ^^^^^Jss. 
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then draw off the scores, noting the number of papers falling 
at each point. This gives the distribution. For further use, 
the teacher will need to supply the names of pupils opposite 
each score, or, in case she is noting mistakes, opposite each 
mistake. The results of any test cannot be intelligently used 
until they have been arranged in some systematic order, 
particularly if the number of pupils involved is large. 

5. Statistical Calculations and Graphic Representations, — 
The statistical points to be determined after the results of a 
test have been tabulated are usually the median, the quartiles, 
and, sometimes, the average or standard deviation. These 
points will be explained in the next section of this chapter. 
When understood, they are very valuable, particularly in 
making a comparison of one room or one school system with 
another. 

To represent the scores graphically often helps the teacher to 
see points which would otherwise remain hidden. A graphic 
representation is made by noting the number of scores falling 
at each point of the scale, and representing the number by 
the distance from the base line, and then drawing a line 
connecting all of these points. The height of the line above 
the base line enables the teacher to see at a glance just what 
is happening in her class. 

6. Interpretation of Results, — The teacher is warned to 
avoid conclusions until she has mastered the technique and 
the significance of the test and has given it to different groups, 
or enough times to the same group, to clear up in her own mind 
the various questions that may arise in giving the test. Especial 
care should be taken not to draw far-reaching, general con- 
clusions from a test. A test is usually devised for a specific 
purpose. The significance of the test in other fields can be 
known only through the figuring of coefficients of correlation 
after a large number of cases has accumulated. A good 
drawing of the moon by an eighth grade pupil means a good 
drawing oi the moon — not an artist, not an astronomer. 
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Nothing should be taken for granted. Mistakes will be 
avoided by caution, and fear will be eliminated by a thorough 
understanding. 

7. Applying Remedies. — The ultimate purpose of a test, 
so far as the individual teacher is concerned, is to enable her 
to see the needs of her pupils and to search out the appropriate 
remedies. The discovery of the remedy in any subject 
takes her into the question of methods of teaching, but this 
is a desirable result. To use a test for measurement only, 
without carrying the work forward to a point of use and 
apphcation in better teaching, is to close the eyes to the 
significance of a situation after it has been revealed. The 
teacher, after giving a test, is in the position of a speciaHst who 
has diagnosed a bodily ailment. The diagnosis means nothing 
unless the appropriate remedy is applied. The recognition of 
this fact leads a teacher or a group of teachers, again and 
again, into the study of methods of teaching with reference 
to the subject tested. 

8. Cooperation, — In a city system, the closest possible 
cooperation is urged between supervisor and teachers, not 
only for the benefit of the teachers, but as well for the benefit 
of the supervisor. Cooperation, understanding, and mutual 
confidence are always valuable assets, and especially so in the 
use of the tests which may reveal teacher weaknesses as 
well as pupil weaknesses. The teacher, however, will be the 
first to want to correct any revealed defects, and her interest 
and cooperation will enable the superintendent or supervisor 
to secure other important results, such as : 

a. A more scientific attitude toward school work. 

b. A closer, checking of results and a realization that pupil 
errors are specific and need individual attention. 

c. Better time allotments, more definite assignments, a 
clearer conception of the objectives to be attained, and more 
efficient methods of teaching. 

Statistical Terms. — The purpose of the statistical tre^.^-- 
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ment of scores is intelligent interpretation. The first step in 
the handling of scores is to give them systematic arrangement. 

A distribution is a systematic arrangement of scores. 

A table of freqiiency is a table showing the scale and the 
distribution of scores at each point on the scale. 

The following are the unarranged grades of seventy-seven 
sixth grade pupils in arithmetic: 74, 92, 65, 69, 76, 80, 62, 
73, 85, 81, 79, 66, 59, 75, 76, 81, 84, 74, SS, 73, 86, 75, 71, 60, 
92, 8s, 76, 82, so, 65, 92, 100, 81, 7S, 8s, 97, 6s, 91, 8s, 86, 
72, 55, 75, 75, 72, 77, 62, 95, 87, 75, 75, 70, 76, 87, 8s, 82, 67, 
90, 81, 95, 80, 86, 80, 75, 67, 70, 72, 84, 76, 70, 88, 72, 80, 75, 
67, 82, 72. 

Thus arranged, the scores have little significance. They 
need statistical interpretation. The following table of fre- 
quency shows a scale with intervals of 5, and on the right hand 
side the number of scores at each point on the scale. This 
right-hand column represents the distribution, A grade is 
recorded at the nearest " 5 '' point on the scale, thus, 74 is 
recorded or scored at 75, 92 at 90, etc. 

Table 58. — Frequency Table : Showing the Arithmetic Scores 

OF 77 Sixth Grade Pupils 



Grades, Intervals op s (Scale) 


Number of Scores at Each Point on Scale 
(Frequency) 


50 


I 


55 


2 


60 


4 


65 


7 


70 


10 


75 


19 


80 


12 


85 


12 


90 


6 


95 


3 


TOO 


I 


\ 


Total 77 
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This table is much more useful than the undistributed 
grades, as it enables the teacher to see the number of pupils 
(or nmnber of scores) at each point on the scale. 

Special signijicance is usually attached to certain points 
on the scale, such, for instance, as the passing mark. If 70 is 
the passing grade, the teacher sees at once that 14 of the pupils 
have failed. 

Other points on the scale that have statistical value are the 
median, the quartiles, the mode, the average, and the range. 

The median is the middle score, or the point on the scale 
above and below which an equal nimiber of scores fall after 
the scores have been arranged into a table of frequency. In 
Table 58 there are 77 scores, so that the middle one would be 
the 39th score from either end of the distribution. The 39th 
score falls at 75, and therefore 75 is the median score. In case 
of an even number of scores, the median is the average of the 
two middle scores. 

The quartiles are the points on the scale arrived at by taking 
\ and I" the scores, counting in from either end. It is usual 
to start at the bottom of the scale, so that counting up until 
\ the scores have been covered locates the point on the scale 
known as the^r^^ quartile, and the distance up the scale neces- 
sary to include -f of the scores locates the third quartile. The 
second quartile is seldom referred to as it is the same as the 
median. It is evident that the first and third quartiles are 
the points midway between the median and the extremes. 
The middle 50% is a term frequently used. It represents the 
number of scores falUng between the first and third quartiles. 

The extremes are the outside limits of the distribution, 
and the distance between the extremes indicates the range of 
the distribution. 

The mode is the point on the scale where the greatest number 
of scores fall. In Table 58, the mode is 75. 

The average is found by adding the scores together, and 
dividing the sum by the number of scores. 
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Deviation. — Some method of indicating by a single figure 
the deviation of the scores from some central point like the 
median is frequently used. Average deviation is most often 
used and it is found by taking the average of the deviations 
of the individual scores from some central tendency, usually 
the median. Standard deviation is also used to express devia- 
tion. It equals the square root of the sum of the squares 
of the deviations from the arithmetical average (although 
the median may be used instead of the average). 

The teacher will have Uttle use for figuring deviation in 
the present volmne. 

Correlation, — The relation between two paired series may 
be expressed by a single figure known as the coefficient of 
correlation. The figure ranges from — i to +1, the latter 
figure representing perfect correlation or agreement. The 
work of the present volume will not require the derivation of 
this term, so the reader is referred to works Usted in the bib- 
Uography in case of a desire to know the method of figuring 
the coefficient of correlation. 
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CHAPTER XII 

THE teachers' USE OF SCALES AND STANDARDIZED TESTS 

The college instructor blames the high school teacher, the 
high school teacher complains of the grade teacher, each 
grade teacher above the first grade finds fault with the poor 
work of the teacher in the grade below, and the first grade 
teacher in turn is chagrined at the shortcomings of the 
home training. Must this go on indefinitely ? Whose opinion 
should prevail? Is it not possible to get away from personal 
opinion to an agreed-upon consensus of opinion? May we 
not replace the constantly conflicting subjective standards 
with definitely defined objective standards? 

Present Grading System. — If 20 mechanics were sent out 
into a mill yard to cut and bring back a steel rod just long 
enough to reach from one girder to another, but were not 
given the measured distance between the girders before going, 
nor permitted to take a ruler or tape to use in selecting the 
rods, no experiment is needed to prove that each one of the 
20 rods would be different in length and no one of them would 
exactly span the distance from girder to girder except by 
chance. On the other hand if the foreman were to use a 
steel tape in measuring the width between the girders, and 
were to permit the mechanics to measure the length of the 
rods before cutting them, they would return with 20 rods 
each meeting with his approval. 

Is it possible for the school foreman, the teacher, to replace 
her subjective standard, her mere opinion, by an objective 
standard approximating the steel tape of the shop? The 
need of more accurate, objective standards in grading is 
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generally appreciated. The following are some of the 
evidences of such need : 

(i) There are constant complaints from teachers in upper 
grades, as indicated above, against the poor quality of work done 
in the lower grades. 

(2) There is wide variation in the distribution of grades 
among the various departments of the same school. In one 
high school, for example, 80% of the EngHsh grades were 90 
or above, while only 4% of the mathematics grades were 90 
or above. In the same high school, the German teacher gave 
70% of her pupils 90 or above, while the Latin teacher gave 
only 2^% of her pupils a grade of 90 or above. 

A recent study of college grading well illustrates this point. 
The study covered a total of 12,782 grades by 10 professors 
covering a period of 5 years. The grades given by professors 
number i, 3, and 4 are shown herewith: 



Propessor 


Grades 
(Total) 


FAn-ED 


76-79 


80-84 


86-89 


90-84 


96-100 


No. I . . . 
No. 3 . . . 
No. 4 . . . 


107 1 
1422 
2196 


32.1% 
9.8 

3-3 


12.7% 
7.0 

6.2 


15.9% 
II.9 

193 


14.9% 

159 
36.2 


12.7% 
14.4 

28.5 


11.5% 
40.7 

6.3 



The contrast between Professor No. i and No. 3, who 
represent the extremes, is brought out more strongly by the 
graphic representation (see Fig. 16) than by the table. Pro- 
fessor No. I fails approximately one third of his students and 
then distributes the others about equally among the 5 remain- 
ing points of the scale. Quite the opposite, Professor No. 3 
gives two fifths of his students an honor grade, and then 
distributes the other grades about equally among the 5 other 
points of the scale. These figures are in the main true for 
each of the 5 years studied, without regard to the maturity of 
the students, whether they be freshmen, sophomores, jimiors, 
or seniors. 
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Fig, 16, — Showing graphically the distribution of grades given by three 
college professors at Iowa State College. 

A study of the distribution of the grades given by tbe faculty 
of any large high school or college is likely to show similar 
results, unless the problem of grading has received special 
attention. 

(3) There b a wide variation in the distribution of grades 
among teachers of the same department. Of 2 instructors 
in the same department i gave to 43% of his students the 
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grade of " excellent '' and to none the grade of " failure," 
whereas the other gave to none of his students the grade of 
" excellent " and to 14% the grade of " failure." ^ There 
must have been a few good and a few bad in each group. 

(4) The fact that pupils transferring from one school 
system to another are frequently demoted indicates that minor 
details rather than large fundamental considerations are the 
determining factors in classifying them. Since pupils are 
constantly shifting, in many schools as high as 20% being 
new to the system each year ,2 this is a very important item. 
In fairness to the child, as well as the school from which he 
came, it should be possible to determine his standing through 
the use of objective standards, and so place him in tJie proper 
grade.3 

Differences in Grading Same Paper. — A study by Dr. 
Daniel Starch illustrates very clearly the variation among 
teachers of a single subject in grading that subject. A paper 
in English was submitted to 142 teachers of English. The 
grades varied from 50 to 97, the passing grade being 75. 
Twenty-six of these teachers, or 18%, marked the paper a 
failure, that is, graded it below 75. On the other hand, 14 

^ Starch, Daniel : " Educational Measurement," p. 3, quoted from Dearborn. 

^ Typical facts with reference to the proportion of school children who leave 
school, because of leaving the city, are easily gathered from current school 
reports. The following are illustrative : 

In Waterbury, Connecticut, 1914-15, there was a total enrollment of 13,954. 
Of this number 902, or 5.4 per cent, left school during the year. Of those who 
left school, 426, or 47.2 per cent, left the city. Similar facts for other cities 
follow : 

In Des Moines, Iowa, 19 13-14, 10.7 per cent left school. The proportion 
of those who left the city was 61.3 per cent. 

In Decatur, Illinois, 1913-14, 15.7 per cent left school. Of these 62.7 per 
cent left the city. 

In Connersville, Indiana, 1910-12, two years combined, 14.8 per cent left 
school. Of these 57.8 per cent left the city. 

Pupils who leave the city will usually enter other school systems. 

' Asst. Supt. O'Hem in the May, 19 18, number of Elementary School Journal 
calls attention to the value of standard tests for placing new pupils in the 
right grades. 
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of the group marked it 90 or above, indicating that in their 
opinion it was a very superior paper. 

In mathematics, a similar test gave results that were even 
more surprising, particularly so in view of the fact that mathe- 
matics is considered one of the exact sciences. A geometry 
paper which was submitted to 118 teachers received grades 
ranging from 29 to 92, the passing mark being 75. Sixty-eight 
of the teachers, or nearly 58% of them, marked the paper a 
failure. Fifty of the group marked it 75 or above, one giving 
it a grade of 92. 

A history paper graded by 70 teachers showed similar 
variations, the grades ranging from 43 to 90. 

This but illustrates the present chaos resulting from the 
lack of standards in grading an ordinary examination paper. 
When this is multiplied by the variation in sets of exam- 
ination questions, it is apparent that on the present basis of 
examinations it is absolutely impossible to compare one sys- 
tem with another, one grade with another, or to compare from 
month to month the same grade with itself. It is unnecessary 
to discuss fully the above points. Others might be added, all 
indicating the need of objective standards. 

Uniform Examination Not Satisfactory. — One may ask if 
the purposes of an objective standard for measuring school 
achievement cannot be accomplished by a uniform course of 
study, uniform examination questions, and uniform grading. 
These items may properly receive attention in order. In the 
first place a uniform course of study is undesirable. It must 
be adjusted to community demands and pupil interests. It 
should differ greatly for children from the exclusive residence 
districts of New York City, and the children from Iowa farm 
homes. To attempt to secure a rigid uniformity in the course 
of study would be deadening in the extreme. The course of 
study should be flexible and provide for local variations. 
To possess knowledge which is useful and usable is much more 
fundamental in a democracy than to strive for a large common 
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intellectual possession composed too largely of material which 
is stale and useless. 

In the second place, all will agree that there is nothing 
more baneful and stupefying in its influence than a rigid 
examination system. It makes subject matter the aim and 
end. It leads to cramming. It militates against use and 
apphcation. It directs pupils to words in books instead of to 
hfe's real problems and their solutions. And strange as it 
may seem, a standard test will accomplish the desirable results 
of a uniform examination system without the undesirable 
results appearing. In subjects for which standard tests are 
not available, examinations must continue to be used. They 
have a value if rightly used. 

In the third place, all will admit that uniformity of grading 
is desirable. It is difficult, however, with an ordinary 
examination, although common practice may be improved by 
adopting a 5-point system and distributing grades according 
to the normal curve of distribution. How to improve the 
grading of a group of teachers along these lines has been well 
explained by Gray, Meyer, Dearborn, Judd, Starch, KeUy/ 
and others. But one of the greatest advantages of the 
standard test or scale is that it greatly aids in securing uni- 
formity of results in grading. In order to standardize a test, 
specific directions have of necessity been prepared for giving 
the test and for grading the returns. All of this means greater 
uniformity in grading, and greater fairness to individual 
classes or pupils in case of comparison. In fact, a standard 
test is, in a way, a well-selected uniform examination, accom- 
panied by specific directions which greatly aid in securing 
uniformity and fairness. 

Standard Tests. — But a standard test is much more than 

^ Kelly, F. J., "Teachers* Marks and their Distribution." Contributions 
to Education No. 66, Teachers College, Columbia University. This volume 
contains a good bibliography on the study of school and college marks and 
grading. 
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a uniform examination. The standardization of a single test 
or scale often requires a year or more of intensive work by 
one of our ablest educators. Not only must the subject 
matter be carefully selected and adapted to pupil ability, but 
it must be tried out with thousands of pupils, revised, and 
again tried out, until every detail of the test, its administration, 
its evaluation, and the grade or age standards, has been 
determined. Such an undertaking is too much to expect 
from the overworked teacher. But the teacher may prop- 
erly be expected to profit by the standard tests of subject 
matter which have become available. 

The difference between an examination and a standard test, 
as well as the progress of measurement in education, is fairly 
well illustrated in the attempt to measure arithmetic in the 
two Cleveland surveys, the first by a local commission in 
1906, the second by a survey committee composed of educa- 
tional experts selected from all parts of the coimtry only nine 
years later, 1915. 

The arithmetic test given in the first Cleveland survey 
was devised by men of maturity and judgment, but had not 
been standardized. It was not even based upon a wise selec- 
tion of subject-matter, and it could not lead to any vaUd con- 
clusions. It was used in at least one later survey.^ It did 
not justify further use, although it was doubtless as good as 
any test that could have been quickly devised under the 
circumstances. At the time Thorndike's writing scale had 
just appeared but had not come into general use, and there 
were no standard tests. 

In 191 5, however, the work of the Cleveland schools was 
measured in a scientific manner which carried conviction every- 
where. In writing, spelling, arithmetic, and reading, scales or 
standard tests were applied which clearly revealed the grade 
to grade progress of the pupils, made possible comparison of 
one building with another, and permitted comparison of the 

^ East Orange, N. J., 191 1, by Dr. E. C. Moore. 
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work in Cleveland with similar work in other cities through- 
out the country. 

While a particular teacher need not be greatly concerned 
about having a test that will permit comparison of the work 
in one city with the work in another, or even a comparison 
of her work with the work of other teachers in the same grade 
throughout the system in which she works, yet she should be 
concerned about the progress of the children within her own 
room. She should know the results of her work. She 
should have a device for the definite measurement of progress, 
due to a particular method, or a given time devoted to the 
work. These aims cannot be accomplished through the 
ordinary examination. They can be accomplished only 
through the use of scales and standardized tests. 

Initiating the Use of Standard Tests. — Whether the 
initiative in the use of standard tests be taken by the teacher, 
the superintendent, or a survey commission, the final result 
should be to help the teacher, and through her, the pupil. 

Miss Laura Zirbes,^ of the Cleveland University School, 
took the initiative in the use of standard tests, completely 
transformed her own theory and practice, and brought new 
life and more rapid progress to her pupils. In Boston, the 
initiative came from the central office, but in such sympathetic 
and cooperative form that teachers were effectively reached. 
Of more pronounced effect probably than any of these factors, 
however, was the stimulation among the Boston teachers of 
an inquiring attitude towards the whole problem of arithmetic 
instruction. " The results from the tests have shown the 
need of improvement; they have shown that the problem 
of arithmetic teaching is not yet solved, and they have 
prompted many teachers to study their own work as the first 
step towards improving methods of instruction." ^ Later 

^ "Diagnostic Measurement as a Basis of Procedure," Elementary School 
Journal J March, 191 8, pages 505-522. 
* Boston, Educational Bulletin No. X. 
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an entire bulletin^ was devoted to showing teachers and 
principals how to use the results of standard tests in reaching 
individual pupils and improving instruction. 

The teacher who uses a standard test in her own room for 
the purpose of knowing her pupils or locating the weak places 
in her instruction may take pride in the fact that she is putting 
herself in line with a vast army of scientific workers in educa- 
tion. She determines the median abiUty of 30 pupils in a 
single grade, the distribution of ability, the points of weak- 
ness, and the remedies to apply. A principal does the same 
for the entire building; the superiiitendent for the entire 
school system ; a state bureau for the state ; and a research 
specialist, by combining city and state results, gets norms of 
performance for a nation. The teacher thus sees herself as a 
contributor in a great piece of constructive work in scientific 
education, and she may, if she wishes, locate her particular 
group of children with reference to the thousands of other 
children throughout the county, — she may feel the thrill of 
being one of the 700,000 lieutenants who marshal the army 
of 22,000,000 American school children, in the interests of a 
safer and saner democracy. 

Uses of a Standard Test. — However, the most helpful 
point for the present purpose is that standard tests should 
be used by the individual teacher for the purpose of finding 
the weaknesses in her own work, evaluating methods, and 
definitely measuring the progress of her own pupils. It will 
be worth while to enumerate in order the uses that a teacher 
may make of a standard test. Some of these are in common 
with the uses which may be made of the results of standard 
tests by principals and superintendents, but many of them 
apply directly to the particular schoolroom and are in addi- 
tion to other uses. Standardized tests may be used : 

I. To determine conclusively whether or not a pupil is 
making progress. 

^ Boston, Educational Bulletin No. XIII. 
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2. To determine how much progress a pupU has made in 
a given time. 

3. To determine whether a pupil should be promoted, 
retained, or reclassified, in so far as the mastery of subject 
matter is made a condition of progress. Dr. Starch states 
that promotion on the basis of measured ability would save 
one year to one third of the pupils in the public schools.^ 

4. To determine even more accurately whether or not the 
class is making progress and the amount of such progress. 

5. To determine whether or not a class is up to standard 
when received from another teacher. This use of the stand- 
ard test would remove the constant complaint of teachers 
that the work has not been covered in the preceding grades. 

6. To justify a year's work with a class on the basis of 
actually measured progress. This will make it possible to 
show to a prejudiced principal or superintendent (if such 
there be) that reasonable progress has been made by a class. 

7. To show results in a manner that completely discounts 
the advantages of another teacher more attractive and popu- 
lar, in case such teacher depends upon winning promotion by 
methods not contributing to pupil progress. 

8. To detect the fact, in case more time cannot profitably 
be spent with retarded pupils. See, for example, the con- 
clusion of Superintendent Bliss of Montclair, N. J., that a 
group of subnormal pupils could not profit by further work in 
arithmetic.^ 

9. To release bright pupils from further work after deter- 
mined standards have been reached, as long as said standards 
are maintained. The teacher would thus limit the work 
required along mechanical and routine lines. Rice's articles ^ 
on the " Spelling Grind " over 20 years ago emphasized 
the fact of wasted youth through the schools. Over emphasis 

* Fifteenth Yearbook of the National Society for the Study of Education, 
Part I, p. 146. 

* Ibtd.f p. 75. ^ The Forunty XXIII : 163-172, 409-419. 
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upon the mechanical phases of school work closes the door 
to story, romance, history, literature, music, and play. 

10. To test one method against another by the amount 
of measured progress made by the pupils, e.g. textbook 
procedure versus large motivated problems, as a basis for 
developing ability in solving reasoning problems (in so far as 
devised tests adequately measure this educational product). 
It is apparent that such use of standardized tests would re- 
place the trial and error method as a means of determining 
correct procedure, and would replace it by a method much 
more scientific. 

11. To test one class plan, study plan,^ or administrative 
device against another, by measured results with the pupils. 

12. To determine the proper apportionment of school 
time to various subjects of study and other school activities. 
This use of standard tests has been well pointed out by Dr. 
Haggerty.^ 

Standard Test Saves Time. — Naturally the teacher asks, 
" But will not this scientific testing require a much larger 
time expenditure than I can give to it? I'm crowded for 
time as it is." 

This question can be answered only on the basis of the 
experience of other teachers. That experience shows that 
after the technique is once mastered, the time required for 
standard testing is not more, but frequently less, than the 
time consumed in marking papers under the old examination 
system. After the writing scale has been used for a while, 
has been conveniently posted for reference by pupils, and has 
been explained to them, the teacher will find that a committee 
of pupils can be relied upon to grade the writing of the room, 
honestly and quite accurately. In fact each pupil will grade 

* See p. 113, Schoolman's Week Proceedings (University of Pennsyl- 
vania), April, 1918, for comparison of class study and independent study in 
spelling. Reported by J. N. Adee, Superintendent Schools, Johnstown, Pa. 

* School and Society, IV : 761-771. 

T 
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his own writing by comparison with the scale. After the 
spelling test has been given, pupils may be allowed to ex- 
change papers and correct them while the teacher gives the 
correct spelling of the words. Likewise in arithmetic, the 
pupils can help the teacher in quickly grading the papers. 
This help by pupils in the shnpler tests should be encouraged 
not alone because it saves the time of the teacher, but chiefly 
because it creates a desirable interest and stimulates the 
pupils to put forth a greater effort to reach a given standard. 
Standard Test a More Effective Tool. — The question 
with regard to the time required for giving standard tests is a 
legitimate one, and an effort has been made to answer it. 
Every conscientious teacher will agree, however, that time 
is not the chief consideration. She puts in a full quota of 
time each day, and will continue to do so. If she is as wise 
as conscientious, she will also provide time for sufficient 
sleep and recreation each day. The chief consideration is 
that the teacher in mastering the details of the use and inter- 
pretation of a standard test is equipping herself with a more 
effective tool of service. Why should the teacher guess and 
estimate when she can measure? The unsatisfactory nature 
of the present grading system has been dwelt upon. A grade 
of 85 in one room cannot be compared with a grade of 85 in 
another room. The present unscientific method of grading 
must be replaced by scientific procedure if we are to continue 
to make educational progress. Improvement is certainly 
hampered by the use of a system which does not even permit 
of comparison, and thus give a definite measure of progress. 
Under the old system when two schools determined to compare 
the spelling ability of their pupils, all that they could do was 
to get the pupils together and have them compete in a spelling 
match. And yet as we look back upon the spelling match 
we see that the result was finally determined by the one best 
speller. The general merit of spelling in one school as com- 
pared with the general merit in the other was not determined. 
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• 

Measuring a Htiman Product. — The teacher may insist 
that she is dealing with a delicate hmnan product. This is 
true ; and yet, as Thorndike has pointed out, mental products 
can be measured and are being measured. " Whatever exists, 
exists in some amount." The work of the physician probably 
compares as closely as any other with that of the teacher. 
We want a physician who is kind and sympathetic, but we 
are not willing that these qualities be substituted for accurate 
and adequate knowledge. Regardless of his kindness and 
sympathy, he counts the pulse, and takes the temperature. 
In case an anaesthetic is to be administered, he calls in an 
expert to determine the amount and to administer it accord- 
ing to standard methods. In case of a surgical operation he 
again calls for an expert, frequently a busy, unsympathetic 
stranger. In all of this work, regardless of his kindness, 
sympathy, geniality, and his spiritual qualities in general, 
he relies upon accurate knowledge, definite measurement, and 
tested skill. He proceeds scientifically. The teacher should 
do likewise. 

" It is a popular superstition that human action, person- 
ality, and behavior will be penned up and hindered when 
measured by logical categories and fixed units. But, just as 
the pound weight has not interfered with the production of 
butter, and the yardstick has not obstructed improvement in 
the manufacture of cotton or other goods, so methods of 
teaching, it may be assumed, * will improve and develop 
freely, even when fixed standards are applied.' The spirit 
can still go where it listeth. Measurement must meekly 
follow, gather up the results, and give them a value." 

Weights and measures call to mind definite units, such as 
pound, quart, and yard, and these are infinitely more valuable 
for commercial purposes than " as much as a man can lift," 
"a small jar full," or "the length of a man's arm." Stand- 
ards have made commercial transactions possible at great 
distances on a basis of perfect understanding and fairness. 
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• 

There is no doubt that teaching and the products of school 
work are going to be benefited in a similar manner by the 
application of definite standards of measurement. Measure- 
ment is always taking place in one form or another. School 
work is being constantly noted as goodj fair, or poor, 
as satisfactory or unsatisfactory , and is constantly being 
rated by such standards as are available, be these standards 
crude or otherwise. 

Many large cities have established bureaus of measurement 
and efficiency. Each bureau has a head with an adequate 
clerical staff. Such an organization is needed in a large city 
even when the teachers administer and grade the tests. A 
central bureau can establish city standards, make valuable 
comparisons, and interpret results in a way to be most val- 
uable and helpful to all teachers as well as superintendents 
and supervisors. But more and more the directors of central 
bureaus realize that they are failing unless they reach the 
individual teachers. Dr. Ballou emphasizes this on every 
page of his recent bulletin interpreting results in arithmetic.^ 
He assures us that in the last analysis " the teacher must 
find out what her trouble is and then apply the remedy." 

The present work makes no effort to discuss the complete 
list of available tests, but instead is limited to such tests as 
have been standardized sufficiently to recommend their use 
to the teacher who, for the most part, is imtrained in the use 
of statistical methods. In beginning the work in measurement, 
teachers should make no effort to employ all available tests, 
but should carefully select the test to be given. As pointed 
out by Ballou, teachers will do well to give tests that are 
reasonably simple, that can be scored and tabulated with 
reasonable ease, and that have been given to a suflScient 
number of children so that well-founded standards of achieve- 
ment have been estabUshed, the first assumption always being 
that the test measures desirable phases of school products. 

^ Boston, Educational Bulletin No. XIII. 
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APPENDIX 

Data for Estimating the Degree of Difficulty Required to 
Produce 20 Per Cent of Wrong or Omitted Responses When 
A Given Step of the Scale Produces from 8 Per Cent to 
40 Per Cent of Such. They are Used in Connection with 

THE THORNDIKE READING TeSTS FOR THE DETERlilNATION OF 

Pupils* Scores. 
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Haggerty and Noonan achievement ex- 
amination in reading: Sigma i, 153; 
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Ayres scale, 24, 28-35 ; what to measure 
in, 24 ; giving the test in, 25-27 ; scor- 
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56; bibliography, 56-57. 
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of English composition, 156. 

History, measurement of, 192-202 ; Bell 
and McCullum test in^ 193-195 ,* Van 
Wagenen history scales, 196-201 ; diag- 
nostic tests in, 201-202; bibliography, 
209-210. 
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234; differences among children, 233; 
Trabue language scales, 235-241; Otis 
group scale, 241-245; Haggerty's 
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test, Stanford revision, 250-252; bibliog- 
raphy, 252-253. 
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ports, 41. 
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16-18. 

Kansas silent reading test, 149; standard 
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in, 235-241. 
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Mode, 261. 

Monroe, Walter S., standardized silent 
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description of, 112; giving test, 114; 
interpreting and using results, 114; 
class record sheet, 115 ; standard scores, 
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Music, Seashore tests in, 209. 
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Quartile, 261. 
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Rice, J. M., referred to, i, 20; spelling 
test, 13-16. 
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matics, 221-222. 

Rugg and Clark, algebra test, 215-220. 
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19-21 ; standard, 271-274. 

Seashore, C. E., music tests, 209. 

Silent reading test, 152. 

Smith, William Hawley, referred to, 20. 
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scale, 6-10; giving a test, 7; scoring 
the papers, 8; distributing scores, 9; 
Buckingham's extension of Ayres scale, 
10; Iowa spdling scale, 11 ; Buckingham 
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Statistical methods, securing comparable 
results, 254-255; using a standardized 
test, 255-259; selecting the test, 256; 
scoring the papers, 257; tabulating 
results, 257; statistical terms, 25^262; 
standard tests, 268-274; applied to 
human product, 275-276. 

Stockard and Bell, geometry tests, 220-221. 

Stone, C. W., reasoning tests in arithmetic, 

75-78- 
Surveys. See Butte, Cleveland, Grand 
Rapids, Salt Lake City, Springfield. 

Table of frequency, 260. 



ment examination in, 153-154; biWl^ "teachers' composition scale, 172-178; 
ography, 154-155. ^ Gasy composition scale as plan, 172; 



need of, 172 ; value to teachers, 178. 
Thomdike, E. L., referred to, i, 24, 269; 

handwriting scale, 24, 53-55 ; reading 

scales, 1 19-126, 152 ; extension of Hille- 

gas composition scale, 178; drawing, 

scale, 181-186. 
Trabue, M. R., Nassau County supplement 

to the Hillegas scale, 157-166 ; aim, 157 ; 

description of, 157 ; applying scale, 161 ; 

scoring results, 161 ; interpreting and 

using results, 162 ; class record sheet, 

162. 

Visual vocabulary test, Part I, 143-148; 
aim, 143; description of, 143; giving 
test, 144; scoring results, 144; inter- 
preting results, 145; using results, 146; 
corrective measures, 148. 

Visual vocabulary test, Part II, 151. 

Whipple group tests for grammar grades, 
249. 



16-18 ; pupU's list of misspelled words. Willing scale for measurmg written com- 
position, 166-172 ; aim, 166; description 
and application of, 166 ; scoring results, 
166; class record sheet, 167 ; interpret- 
ing and using results, 168; class scores, 
1 68 ; scoring errors for general merit, 169. 

standards in handwriting, 39, 40; withers, John W., referred to, 2. 

punctuation scale, 179; geography test, Woody, Clifford, arithmetic scales, 78-91. 



18; uses of spelling scale, 19-21; 

methods of teaching, 21 ; bibliography, 

22. 
Springfield (111.) survey, 48. 
Starch, Daniel, spelling test, 15-16; 
m 



202; language tests, 207-208; physics 
tttst, 223; quoted, 266. 



Zirbes, Laura, referred to, 270. 



Printed in the United States of America. 
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